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(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: BIO MERIEt 

( B ) STREET : CHEMIN/ DE . L ' ORME 

(C) CITY: MARCY l/eTOILE 
(E) COUNTRY: FRANCE 

10 (F) POSTAL CODE^ 69280 

(ii) TITLE OF INVENTION: NUCLEIC MATERIAL OF THE 
_ ENDOGENOUS RETROVIRAL GENOMIC TYPE, ASSOCIATED WITH AN 
" AUTOIMMUNE DISEASE AND/pR WITH PREGNANCY DISORDERS; USE 
=p 15 AS MARKER 

p (iii) NUMBER OF SEQUENCES: 35 

I / 

(iv) COMPUTER /READABLE FORM: 

M 20 (A) MEDIUM TYPE: Floppy disk 

m / 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, 
Version #1.30 (EPO) 

/ 

(2) INFORMATION FOR SEQ ID NO: 1: 

/ 

(i) SEQUENCE CHARACTERISTICS: 
(pI) LENGTH: 1321 base pairs 

30 (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 

(ii) / MOLECULE TYPE: mRNA (as DNA) 

35 

(iii) HYPOTHETICAL: NO 



REPLACEMENT SHEET (RULE 26) 




pa 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CAACAATCGG GATATAAACC CAGGCATTCG AGCTGGCAAC AGCAGCCCCC CTTTGGGTCC 60 

CTTCCCTTTG TATGGCAGCT CTTTTCATGC TATTTCACTC TATTAAATCT TGCAACTGCA 120 

CTCTTCTGGT CCATGTTTCT TACGGCTCGA GCTGAGCTTT TGCTCACCGT CCACCACTGC 160 

TGTTTGCCAC CACCGCAGAC CTGCCG CTG A CTCCCATCCC TCTGGATCCT GCAGGGTGTC 240 

CGCTGTGCTC CTGATCCAGC GAAGCGCCCA TTGCCGCTCC CAATTGGGCT AAAGG CTTGC 300 

CATTGTTCCT GCACGGCTAA GTGCCTGGGT TTGTTCTAAT TGAGCTCAAC ACTAGTCACT 360 

GGGTTCCATC GTTCTCTTCT GTGACCCACG GCTTCTAATA GAACTATAAC ACTTACCACA 420 

TGGCCCAAGA TTCCATTCCT TGGAATCCGT GAGGCCAAGA ACTCCAGGTC AGAGAATACG 480 

AAGCTTGCCA CCATCTTGGA AGCGG CCTGC TACCATCTTG GAAGTGGTTC ACCACCATCT 540 

TGGGAGCTCT GTGAGCAAGG ACCCCCCGGT AACATTTTGG CAACCACGAA CGGACATCCA 600 

AAGTGATGGG AAACGTTCCC CGCAAGACAA AAACGCCCCT AAGACGTATT CTGGAAAATT 660 

GCGAACAATT TGACCCTCAG ACACTAAGAA AC AAA CG ACT TATATTCTTC TGCAGTGCCG 720 

CCTGGCACTC CTGAGGGAAG TATAAATTAT AACACCATCT TACAGCTAGA CCTCTTTTGT 780 

AGAAAAGGCA AATGGAGTGA AGTGCCATAA GTACAAACTT TCTTTTCATT AAGAGACAAC 840 

TCACAATTAT CTAAAAAGTG TGATTTATGC CCTACAGGAA GCCTTCAGAG TCTACCTCCC 900 

TATCCCAGCA TCCCCGACTC CTTCCCCACT TAATAAGGAC CCCCCTTCAA CCCAAATGGT 960 

CCAAAAGGAG ATAGACAAAA GGGTAAACAG TGAACCAAAG AGTGCCAATA TTCCCCAATT 1020 



REPLACEMENT SHEET (RULE 26) 



ATCACCCCTC CAAGCAGTCG GAGGAAGAGA ATTCGGCCCA G CC AG AG TG C ATGTCCCTTT 1080 
TTCTCTCCCA GACTTAAAGC AAATAAAAAC AGACTTAGGT AAATTCTCAG ATAACCCTGA 114 O 
TGGCTATATT GGTGTTTTAC AAGGGTTAGG ACAATTCTTT GATCTGACAT CG AG AG AT AT 1200 
ATATGTCACT GCTAAATCAG ACACTAACCC CAAATGAGAG AAGTGCCACC ATAACTGCAG 1260 
CCTGAGAGTT TGGCGATCTC TGGTATCTCA GTCAGGTCAA TGATAGGATG ACAACAGAGG 1320 
A 1321 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 93 8 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA (as DNA) 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CAACGACGGA CATCCAAAGT GATGGGAAAC GTTCCCCGCA AGACAAAAAC GCCCCTAAGA 60 
CGTATTCTGG AGAATTGGGA CCAATTTGAC CCTCAGACAC TAAGAAAGAA ACGACTTATA 120 
TTCTTCTGCA GTGCCGCCTG GCACTCCTGA GGGAAGTATA AATTATAACA CCATCTTACA 180 
GCTAGACTTC TTTTGTAGAA AAGGCAAATG GAGTGAAGTG CCATAAGTAC AAACTTTCTT 240 



REPLACEMENT SHEET (RULE 26) 



TTCATTAAGA GACAACTCAC AATTATGTAA 
TCAGAGTCTA CCTCCCTATC CCACCATCCC 
CTTCAACCCA AATGGTCCAA AAGGAGATAG 
CCAATATTCC CCAATTATGA CCCCTCCCAA 
GAGTGCATGT GCTTTTTCTT CTCCCAGACT 
TCTCAGATAA TCCTGATGGC TATATTGATG 
TGACATGGAG AGATATAATG TCACTGCTAA 
CCACCATAAC TGCAGCCTGA GAGTTTGGCG 
GGATGACAAC A GAG G AAA G A GATGATCCCC 
CTCATTGGGG ACACAGAAAT CAGTAACATG 
TGTGTGCTAC AAGGACTAAG GAAAACTACG 
CACCATAACA CAGGGGAAGG GAAGAAAATC 
CATTGAGGAA GCGTGCCTCT CTGTCACCTG 
GTAACTTTAT CACTCAGTCA GCTGCAGACA 
CCCGGAGCAA AACTTAGAAA CCCTATTGAA 
TCAGGAGGAG CAGGCGGAAC AGGACAAACG 
TGACCCTCAG GCAAGTGGAC TTTCGACGCT 
TGCCTAATAG GGCTTCCTTC CAGTGCGGTC 



AAAGTGTGAT TTATGCCCTA CAGGAAGCCT 300 
CGACTCCTTC CCCAACTAAT AAGGACCCCC 360 
ACAAAAGGGT AAACAGTGAA CCAAAGAGTC 420 
C CAGTGGG AG GAAGAGATTC GGCCCAGCCA 480 
TAAAGCAAAT AAAAACAGAC TTAGGTAAAT 540 
TTTTACAAGG GTTAGGACAA TTCTTTGATC 600 
ATCAGACACT AACCCCAAAT GAGAGAAGTG 660 
ATCTCTGGTA TCTCAGTCAG G TCAATG AT A 720 
ACAGCCAGCA AGCAGTTCCC AGTCTASACC 780 
GGAGATTGGT GCTGCAGACA TTTGCTAACT B40 
AAGAAAATCT ACGAATTACT CAATGATGTC 900 
CTACTGCCTT TCTGGAGAGA CTAAGGGAGG 960 
ACTCTTCTG A AGGCCAACTA ATCTTAAAGC 1020 
TTAGAAAAAA CTTCAAAAGT CTGCCGTAGG 1080 
CTTGGCAACY TCGGTTTTTT ATAATAGAGA 1140 
GGATTAAAAA AAAGGCCACC GCTTTAGTCA 1200 
CTGGAAAAGG GAAAAGCTGG GCAAATTGAA 1260 
TACAAGGACA CTTTAAAAAA CATTGTCCAA 1320 



REPLACEMENT SHEET (RULE 26) 



GTAGAAGTAA GCCGCCCCTT CGTCCATGCC 
ACTGCCCCAG GGGACAAAGG TCTTTTCAGT 
ACCACTGAGG GTGCCTGCGG CAAGCGCCAT 
TATGCTTGAC CATTGAGGGC CAGGAAGGTT 
TCTTACTCTT CTGTCCCGGA CAACTGTCCT 
AGACGGGCAG TCACTAGATA CTTCTCCCAG 
TTTTCACATG CTTTTCTAAT TATGCTTGAA 
CTAGCAAAAG CAGGGGCCAT TATACACCTG 
CCCCTGCTTG AGGAAGGAAT TAATCCTGAA 
GCAAAGAATG CCCGTCCTGT TCAAGTTAAA 
AGGCAGTACC CCCTCAGACC CAAGGCCCAA 
AAAGCCCAAG GCTTAGTAAA ACCATGCATA 
GGAGGCACAG AAACCCAGTG G AC AGTGG AG 
GAGGCCGTTG TCCTTTTATA CCCAGCTGTA 
CCAGAGGAAG CAGAGTGGTT TACACTCCTG 
GTACATCCTG ACTCTCAATT CTTGTTTGCC 
CTCACCTGGA CTGTTTTACC CCAAGGGTTC 
TTAGCCCAAG ACTTG AG CCA AT C CTC AT AC 



CCTTATTTCA AGGGAATCAC TGGAAGGCCC 1380 
CAGAAGCCAC TAACCAGATG ATCCAGCAGC 1440 
CCCATGCCAT CACCCTCACA GACCCCTGCG 1500 
GTCTCCTGGA CACTGGTGCG GTCTTCTTAG 1560 
CCAGATCTGT CACTATCTGA GGGGGTCCTA 1620 
CCACTAAGTT ATGACTGGGG AGCTTTATTC 1680 
AGCCCCACTA CCTTGTTAGG GAGAGACATT 1740 
AACATAGGAG AAGGAACACC CGTTTGTTGT 1800 
GTCTGGGCAA CAGAAGGACA ATATGGACGA 1860 
CTAAAGGATT CCACTTCCTT TCCCTACCAA 1920 
CAAGGATTCC AAAAGATTGT TAAGGACTTA 1980 
ACTCCCTGCA GTAATTCCGT AGTGGATTGA 2040 
CCTTAGTCCA AGATCTCAGG ATTATCAATG 2100 
CCTAGCCCTT ATACTGTGCT TTCCCAAATA 2160 
GACCTTAAGG ATGCCTTCTT CTGCATCCCT 2220 
TTTGAAGATA CTTCAAACCC AACATCTCAA 2280 
AGGGATAGCC CCCATCTATT TGGCCAGGCA 2340 
CTGGACACTT GTCCTTCGGT AGGTGGATGA 2400 



REPLACEMENT 



SHEET (RULE 26) 



TTTACTTTTG GCCGCCCATT CAGAAACCTT GTGCCATCAA GCCACCCAAG CCCTCTTCAA 2460 
TTTCCTCGCT ACCTGTGGCT ACATGGTTTC CAAACCAAAG GCTCAACTCT GCTCACAGCA 2520 
GGTTACTTAG GGCTAAAATT ATCCAAAGGC ACCAGGGCCC TCAGTGAGGA ACACATCCAG 2580 
CCTATACTGG CTTATCCTCA TCCCAAAACC CTAAAGCAAC TAAGGGGATT CCTTGGCGTA 2640 
ATAGGTTTCT CCCGAAAATG GATTCCCAGG TTTGCCGAAA TAGCCAGGTC ATTAAATACA 2700 
CTAATTAAGG AAACTCAGAA AGCCAATACC CATTTAGTAA GATGGACAAC TGAAGTAGAA 2760 
GTGGCTTTCC AGGCCCTAAC CCAAGCCCCA GTGTTAAGTT TGCCAACAGG GCAAGACTTT 2820 
TCTTCATATG TCACAGAAAA AACAGGAATA GCTCTAGGAG TCCTTACACA GATCCGAGGG 2880 
ATGAGCTTGC AACCTGTGGC GTACCTGACT AAGGAAATTG ATGTAGTGGC AAAGGGTT 2938 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1422 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA (as DNA) 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TCAGGGATAG CCCCCATCTA TTTGGCCAGG CATTAGCCCA AGACTTGAGT CAGTTATCAT 60 
ACCTGGACAC TCTTGTCCTT CAGTATGTGG ATGATTTACT TTTAGCTGCC TGTTCAGAAA 120 



REPLACEMENT SHEET (RULE 26) 



CCTTCTGCCA TCAAGCCACC CAAGCACTCT 
TTTCCAAAGA GAAGCTCAGC TCTGCTCACA 
CAAAGGCACC AAGGCCCTCA GTGAGGAATG 
CAAAACCCTA AAGCAACTAA GAGACTTCCT 
TCCCCAGGTA TGGCAAAATA GCCAGGCCAT 
CCAATACCCA TTTAATAAGA TGGATACCTG 
AAGGCCTTAA ACCCAAGTCC CAGTGTTAAG 
CATCACAGAA AAAAACAGAA AC AG CTCTGG 
TGCAACCCAT GGCATACCTG AGTAAGGAAA 
GTTTATGGGT AGTGGTCGCA GTAGCAGTTG 
GGAGAGATCT TACTGTGTGG ACATCTCATG 
ACTTGTGGCT GTCAGACAAC CGTTTACTTA 
TGCTGCAACT GTGCACTTGT GCAACTCTTA 
ATAGAATATA ACTGTCAACA AATAATTTCT 
GAAGTTCCCT TGACTGATCC TGACCTTCAA 
AAAAGGACTT CAAAAGCGGG GTATGCAGTG 
CCCTCACTCC AGGAACTAGT GCTTAGCTGG 
AATTAGGAGA AGGAAAAAGC GTAAATATAT 



TAAATTTCCT CGCCACCTGT GGCTACAAGG 180 

GCAGGTTAAA TACTTAGGAC TAAGATTATC 240 

TATCCAGCCT ATACTGGCTT ATCCTCATCT 300 

TGGCATAACA GGCTTCTGCC GAATATGGAT 360 

TATATACAGT AATTAAGGAA ACTCAGAAAG 420 

AAGCCAAAGT GGCTTTCCAG GCCCCTAAAG 480 

CTTG CCAACG GGG CAAG ACT TTTCTTTATA 540 

GAGTCCTTAC ACAGGTCCAA GGGACGAGCT 600 

CTGATCTAGT GGCAAAGGGT TGGCTTCATT 660 

TAGTATCTGA AGCAGTTAAA ATAATACAGG 720 

AGGTGAACAG CATACTCACT GCTAAAGGAG 780 

AATATCAGGC TCTATTACTT GAAAGGCCAG 840 

ACCCAGTCNC ATTTCTTCCA GACAATGAAG 900 

CAAACCTATG CCACTCGAGG GGACCTTCTA 960 

CTTG T AT A CT GATGGAAGTT CCTTTGTAGA 1020 

GTCAGTGATA ATGGAATATT TGAAAGTATC 1080 

CAGAACTAAT AGCCTTCATT GGGG C ACT AG 1140 

ATACAGACTC TCAGTATGCT CACCTAGTCN 1200 



REPLACEMENT 



SHEET (RULE 26) 



TCCATGCCCA TGACGCAATA TGCAGAGAAA GGGAATTCCT AACTTCCGAG GGAACACCTA 1260 
TCACACATCA CGAACCCATT AG GAG ATT AT TACTGGCAGT ACAGAAACCT AAAGAGGTGG 1320 
AAGTCTTACA CTGCTGGGGT CATCAGAAAG GAAAGAAAAG GGAAATAGAA GCGAATTGCC 1380 
AAGCAGATAT TGAAGCAAAA AGAGCTGCAA GGCAGGACCC TC 1422 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2006 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA (as DNA) 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATGCAGTGGT CAGTGATAAT GGAATACTTG AAAGTAATCC CCTCACTCCA GGAACTAGTG 60 

CTCAGCTAGC AGAACTAATA GCCCTCACTT GGGCACTAGA ATTAGGAGAA GAAAAAAGGG 120 

CAAATATATA TACAGACTCT AAATATCCTT ACCTAGTCCT CCATCCCCAT GCAGCAATAT 180 

GGAAAGAAAG GGAATTCCTA ACTTCTGAGA GAACACCTAT CAAACATCAG GAAGCCATTA 240 

GGAAATTATT ATTGGCTGTA CAGAAACCTA AAGAGGTGGC AG T CTT AC AC TGCCGGGGTC 300 

ATCANAAAGG AAAGGAAAGG GAAAATACTT TTGCCTGCAA CTATCCAATG GAAATTACTT 360 
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AAAACCCTTC ATCAAACCTT TCACTTAGGC ATCGATAGCA CCCATCAAAT GGCCAAATCA 420 

TTATTTACTC GACCAGGCCT TTTCAAAACT ATCAAGCAAA TATTCAGGCC CTGTGAATTG 480 

TGCCAAAAAA ATAATCCCCT GCCTCATCGC C AAG CTCCTT CAGGAAAACA AAAAACAGGC 540 

CATTACCCTG AAAAAAACTG GCAACTGATT TTACCCACAA GCCCAAACCT CAGGGATTTC 600 

AGTATCTACT AGTCTGGGTA AATACTTTCA CGGGTTGGGC AAAGGCCTTC CCCTGTAGGA 660 

CAGAAAAGGC CCAAGAGGTA ATAAAGGCAC TAGTTCATGA AATAATTCCC AG ATTCGG AC 720 

TTCCCOGAGG CTTACAGAGT GACAATAGCC CTGCTTTCCA GGCCACAGTA ACCCAGGGAG 780 

TATCCCAGGC GTTACGTATA CG AT A TC ACT TACACTGCGC CTGAAGGCCA CAGTCCTCAG 840 

GGAAGGTCGA GAAAATGAAT GAAATACTCA AAGGACATCT AAAAAAGCAA ACCCAGGAAA 900 

CCCACCTCAC ATGGCCTGCT CTGTTG CCTA TAGCCTTAAA AAGAATCTGC AACTTTCCCC 960 

AAAAAGCAGG ACTTAGCCCA TACGAAATGC TGTATGGAAG GCCCTTCATA ACCAATGACC 1020 

TTGTGCTTGA CCCAAGACAG CCAACTTAGT TGCAGACATC ACCTCCTTAG CCAAATATCA 1080 

ACAAGTT C TT AAAACATTAC AAG G AA CCTA TCCCTGAGAA GAGGGAAAAG AACTATTCCA 1140 

CCCTTGTGAC ATGGTATTAG TCAAGTCCCT TCTCTCTAAT TCCCCATCCC TAGATACATC 1200 

CTGGGAAGGA CCCTACCCAG TCATT T TATT TACCCCAACT GCGGTTAAAG TGGCTGGAGT 1260 

GGTCTTGGAT ACATCACACT TGACTCAAAT CCTGGATACT GCCAAAGGAA CCTGAAAATC 1320 
GAG G AG AC AA CGCTAGCTAT TCCTGTGAAC CTCTAGAGGA TTTGCGCCTG CTCTTCAAAC 1380 
AACAACCAGG AGGAAAGTAA CTAAAATCAT AAATCCCCCA TGGCCCTCCC TTATCATATT 1440 
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TTTCTCTTTA CTGTTCTTTT ACCCTCTTTC 
ATGACCAGTA CCTCCCCTTA CCAAGAGTTT 
GATGCCCCAT CGTATAGGAG TCTTTCTAAG 
ATGCCCCGCA ACTGCTATCA CTCTGCCACT 
ACAGGAAAAA TGATTAATCC TAGTTGTCCT 
TACTTCACCC AAACTGGTAT GTCTGATGGG 
CATGTAAAAG AAGTAATCTC CCAACTCACC 
GACTAGATCT CTCAAAACTA CATGAAACCC 
TTAATACCAC CCTCACTGGG CTCCATGAGG 
TATGCCTCCC CCTGAACTTC AAGCCA 



ACTCTCACTG CACCCCCTCC ATGCCGCTGT 1500 
CTATGGAGAA TGCAGCGTCC CGGAAATATT 1560 
GGAACCCCCA CCTTCACTGC CCACACCCAT 1620 
CTTTGCATGC ATGCAAATAC TCATTATTGG 1680 
GGAGGACTTG GAGTCACTGT CTGTTGGACT 1740 
GGTGGAGTTC AAGATCAGGC AAGAGAAAAA 1800 
CGGGTACATG GCACCTCTAG CCCTACAAAG I860 
TCCGTACCCA TACTCGCCTG GTAAG CCT AT 1920 
TCTCGGCCCA AAACCCTACT AACTGTTGGA 19 BO 

2006 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 8 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA (as DNA) 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ACTCCACTCT TCTGGTCCAT CTTTCTTACG GCTCGAGCTG AGCTTTTGCT CACCGTCCAC 60 
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- 11 - 

CACTGCTGTT TOCCACCACC GCANACCTGC CGCTGACTCC CATCCCTCTG GATCCTGCAG 120 

GGTGTCCG CT GTGCTCCTGA TCCAGCGAGG CGCCCATTCC CGCTCCCAAT TGGGCTAAAG 180 

CCTTGCCATT GTNCCTGCAC GGCTAAGTCC CTGGGTTTGT TCTAATTCAC CTCAACACTA 240 

NTCACTGGGT TCCATGGTTC TCTTCTGTGA CCCACGGCTT CTAATAGAAC TATAACACTT 300 

ACCACATGGC CCAAGATTCC ATTCCTTGGA ATCCGTGAGG GCAAGAACTC C AG G T C AG AG 360 

AATACGAGGC TTGCCACCAT CTTGGAAGCG GCCTGCTACC ATCTTGGAAG TGGTTCACCA 420 

CCATCTTGGG AGCTCTGTGA GCAAGGACCC CCCGGTAACA TTTTGGCAAC CACGAACGGA 480 

CATCCAAAGT GATACATCCT GGGAAGGACC CTACCCAGTC ATTTTATCTA CCCCAACTGC 540 

GGTTAAAGTG GCTGGAGTGG AGTCTTGG AT ACATCACACT TGAGTCAAAT CCTGGATACT 600 



Q GCCAAAGGAA CCTGAAAATC CAGGAGACAA CGCTAGCTAT TCCTGTCAAC CTCTAGAGGA 660 

si 

TTTGCGCCTG CTCTTCAAAC AACAACCAGG AGGAAAGTAA CTAAAATCAT AAATCCCCAT 720 

RJ GGCCCTCCCT TATCATATTT TTCTCTTTAC TGTTGTTTCA CCCTCTTTCA CTCTCACTGC 780 

frj ACCCCCTCCA TGCCGCTGTA TGACCAGTAG CTCCCCTTAC CAAGAGTTTC TATGGAGAAT 840 

seer 

tf3 

GCAGCGTCCC GGAAATATTG ATGCCCCATC GTATAGGAGT CTTTGTAAGG GAACCCCCAC 900 

CTTCACTGCC CACACCCATA TGCCCCGCAA CTGCTATCAC TCTGCCACTC TTTGCATGCA 960 

TGCAAATACT CATTATTGGA CAGGAAAAAT GATTAATCCT AGTTGTCCTG GAGGACTTGG 1020 

AGTCACTGTC TGTTGGACTT ACTTCACCCA AACTGG TATG TCTGATGGGG GTGGAGTTCA 1080 

AGATCAGGCA AGAGAAAAAC ATGT AAAAG A AGTAATCTCC CAACTCACCC GGGTACATGG 1140 



REPLACEMENT 
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CACCTCTAGC CCCTACAAAG GACTAGATCT CTCAAAACTA CATGAAACCC TCCGTACCCA 1200 
TACTCGCCTG GTAAGCCTAT TTAATACCAC CCTCACTGGG CTCCATGAGC TCTCGGCCCA 1260 
AAACCCTACT AACTCTTGGA TATGCCTCCC CCTGAACTTC AGGCCATATG TTTCAATCCC 1320 
TGTACCTGAA CAATGGAACA ACTTCAGCAC AGAAATAAAC ACCACTTCCG TTTTAGTAGG 1380 
ACCTCTTGTT TCCAATCTGG AAATAACCCA TACCTCAAAC CTCACCTGTG TAAAATTTAG 1440 
CAATACTACA TACACAACCA A CTCCCAATG CATCAGGTGG GTAACTCCTC CCACACAAAT 1500 
AGTCTGCCTA CCCTCAGGAA TATTTTTTGT CTGTGGTACC TCAGCCTATC GTTGTTTGAA 1560 
TGGCTCTTCA GAATCTATGT GCTTCCTCTC ATTCTT AG TG CCCCCTATGG CCATCTACAC 1620 
TGAACAAGAT TTATACAGTT ATGTCATATC TAAGCCCCGC AACAAAAGAG TACCCATTCT 1680 
TCCTTTTGTT ATAGGAGCAG GAGTGCTAGG TGCACTAGGT ACTGGCATTG GCGGTATCAC 1740 
AACCTCTACT CAGTTCTACT ACAAACTATC TCAAGAACTA AATGGGGACA TGGAACGGGT 1800 
CGCCGACTCC CTGGTCACCT TGCAAGATCA ACTTAACTCC CTAGCAGCAG TAGTCCTTCA 1860 
AAATCGAAGA GCTTTAGACT TGCTAACCGC TGAAAGAGGG GGAACCTGTT TATTTTTAGG 1920 
GGAAGAATGC TGTTATTATG TTAATCAA 1948 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 6 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA (as DNA) 

(iii) HYPOTHETICAL: NO 

REPLACEMENT SHEET (RULE 26) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CCATGGCCAT CTACACTGAA CAAGATTTAT ACAGTTATGT CATATCTAAG CCCCGCAACA 60 

AAAGAGTACC CATTCTTCCT TTTGTTATAG GAGCAGGAGT GCTAGGTGCA CTAGGTACTG 120 

GCATTGGCGG TATCACAACC TCTACTCAGT TCTACTACAA ACT AT CTC AA GAACTAAATG 180 

GGGACATGGA ACGGGTCGCC GACTCCCTGG TCACCTTGCA AGATCAACTT AACTCCCTAG 240 

CAGCAGTAGT CCTTCAAAAT CGAAGAGCTT TAGACTCGCT AACCGCTGAA AGAGGGGGAA 300 

CCTGTTTATT TTTAGGGGAA GAATGCTCTT ATTATGTTAA TCAATCCGGA ATCGTCACTG 360 

AGAAAGTTAA AGAAATTCGA GATCGAATAC AACGTAGAGC AGAAGAGCTT CCAAACACTG 420 

GACCCTGGCG CCTCCTCAGC CAATGGATGC CCTGGATTCT CCCCTTCTTA GGACCTCTAG 480 

CAGCTATAAT ATTGCTACTC CTCTTTGGAC CCTGTATCTT TAACCTCCTT GTTAACTTTG 540 

TCTCTTCCAG AATCGAAGCT GTAAAACTAC AAATGGAGCC CAAGATGCAG TCCAAGACTA 600 

AGATCTACCG CAGACCCCTG GACCGGCCTG CTAGCCCACG ATCTCATGTT AATGACATCA 660 

AAGGCACCCC TCCTGAGGAA ATCTCAGCTC CACAACCTCT ACTACGCCCC AATTCAGCAG 720 

GAAGCAGTTA GAGCGGTCGT CGGCCAACCT CCCCAACAGC ACTTAGGTTT TCCTGTTGAG 780 

ATGGGGGACT GAGAGACAGG ACTAGCTGGA TTTCCTAGGC TGACTAAGAA TCCCTAAGCC 840 

TAG CTGG G AA GGTGACCACA TCCACCTTTA AACACGGGGC TTGCAACTTA GTTCACACCT 900 

G AC CAATC AG AGAGCTCACT AAAATGCTAA TTAGGCAAAG ACAGGAGGTA AAGAAATAGC 960 

CAATCATCTA TTGCATGAGA GCACAGCAGG AGGGACAATG ATCGGGATAT AAACCCAAGT 1020 

CTTCGAGCCG GCAACGGCAA CCCCCTTTGG GTCCCCTCCC TTTGTATGGG ACCTCTGTTT 1080 

TCATGCTATT TCACTCTATT AAATCTTGCA GCTGCGAAAA AAAAAAAAAA AAAAAA 1136 
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I 



i : 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2782 base pairs 
5 (B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: mRNA (as DNA) 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATGGGAGCTG TTTTCATGCT ATTTCACTCT ATTAAATCTT GCAACTGCAC TCTTCTGCTC 60 



CP CATGTTTCTT ACGGCTCGAG CTGAGCTTTT GCTCACCGTC CACCACTGCT GTTTGCCACC 120 

Q 

w 

j*> ACCGCAGACC TGCCGCTGAC TCCCATCCCT CTGGATCCTG CAGGGTGTCC GCTGTGCTCC 180 

if] TGATCCAGOG AAGCGCCCAT TGCCGCTCCC A ATTGGG CT A AAGGCTTGCC ATTGTTCCTG 240 

ff\ CACGGCTAAG TGCCTGGGTT TGTTCTAATT GAGCTGAACA CTAGTCACTG GGTTCCATGG 300 

d 

^ 15 TTCTCTTCTG TGACCCACGG CTTCTAATAG AACTATAACA CTTACCACAT GGCCCAAGAT 360 
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TCCATTCCTT GGAATCCGTG AGGCCAACGA ACTCCAGGTC AGAGAATACG AAGCTTGCCA 420 

CCATCTTGGA AGCGGCCTGC TACCATCTTG GAAGTGGTTC ACCACCATCT TGGGAGCTCT 480 

GTGACCAACC ACCCCCCGGT G AC ATTTTG G CGACCACCAA CGGACATCCC AAGTGATACA 540 

TCCTGGGAAG GACCCTACCC AGTCATTTTA TCTACCCCAA CTGCGCTTAA AGTCCCTGGA 600 

GTGGAGTCTT GGATACATCA CACTTGAGTC AAATCCTGGA TACTGCCAAA GGAACCTGAA 660 

AATCCAGGAG ACAACGCTAG CTATTCCTGT GAACCTCTAG AGGATTTGCG CCTGCTCTTC 720 

AAACAACAAC CAGGAGGAAA GTAACTAAAA TCATAAATCC CCATGGGCCT CCCTTATCAT 780 

ATTTTTCTCT GTAGTGTTCT TTCACCCTGT TTCACTCTCA CTGCACCCCC TCCATGCCGC 840 

TCTATGACCA GTAGCTCCCC TCACCCAGAG TTTCTATGGA G AATGCAG CG TCCCGGAAAT 900 

ATTGATGCCC CATCG TAT AG GAGTCTTTCT AAGGGAACCC CCACCTTCAC TGCCCACACC 960 

CATATGCCCC GCAACTGCTA TCACTCTGCC ACTCTTTGCA TGCATCCAAA TACTCATTAT 1020 

TGGACAGGAA AAATGATTAA TCCTAGTTGT CCTGGAGGAC TTCGAGTCAC TGTCTGTTGG 1080 

ACTTACTTCA CCCAAACTGG TATGTCTGAT GGGCGTCGAG TTCAAGATCA GGCAAGAGAA 1140 

AAACATGTAA AAGAAGTAAT CTCCCAACTC ACCGGGGTAC ATGGCACCTC TAGCCCCTAC 1200 

AAAGGACTAG ATCTCTCAAA ACTACATGAA ACCCTCCGTA CCCATACTCG CCTGCTAAGC 1260 

CTATTTAATA CCACCCTCAC TGGGCTCCAT GAGGTCTCGG CCCAAAACCC TACTAACTGT 1320 

TGGATATGCC TCCCCCTGAA CTTCAGGCCA TATGTTTCAA TCCCTCTACC TGAACAATGG 1380 

AACAACTTCA GCACAGAAAT AAACACCACT TCCGTTTTAG TAGGACCTCT TGTTTCCAAT 1440 
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GTGGAAATAA CCCATACCTC AAACCTCACC 
ACCAACTCCC AATGCATCAG GTGGGTAACT 
GGAATATTTT TTGTC TGTG G TACCTCAGCC 
ATGTGCTTCC TCTCATTCTT AGTGCCCCCT 
AGTTATGTCA TATCTAAGCC CCGCAACAAA 
GCAGGAGTGC TAGGTGCACT ACGTACTGGC 
TACTACAAAC TATCTCAAGA ACTAAATGGG 
ACCTTGCAAG ATCAACTTAA CTCCCTAGCA 
GACTTGCTAA CCGCTGAGAG AGGGGGAACC 
TATGTTAATC AATCCGGAAT CGTCACTGAG 
CGTATAG CAG AG GAG CT T CG AAACACTGGA 
TGGATTCTCC CCTTCTTAGG ACCTCTAGCA 
TGTATCTTTG ACCTCCTTGT TAACTTTGTC 
ATGGAGCCCA AGATGCAGTC CAAGACTAAG 
AGCCCAOGAT CTGATGTTAA TGACATCAAA 
CAACCTCTAC TACGCCCCAA TTCAGCAGGA 
CCAACAGCAC TTAGGTTTTC CTGTTGAGAT 
TCCTAGGCTG ACTAAGAATC CTTAAGCCTA 



TGTGTAAAAT TTAGCAATAC TACATACACA 1500 
CCTCCCACAC AAATAGTCTG CCTACCCTCA 1560 
TATCGTTGTT TGAATGGCTC TTCAGAATCT 1620 
ATGACCATCT ACACTGAACA AGATTTATAC 1680 
AGAGTACCCA TTCTTCCTTT TGTTATAGGA 1740 
ATTGG CGGTA TCACAACCTC TACTCAGTTC 1800 
GACATGGAAC GGGTCGCCGA CTCCCTGGTC 1860 
GCAGTAGTCC TTCGAAATCG AAGAGCTTTA 1920 
TGTTTATTTT TAGGGGAAGA ATGCTCTTAT 1980 
AAAGTTGAAG AAATTCCAGA TCGAATACAA 2040 
CCCTGGGGCC TCCTCAGCCG ATGGATGCCC 2100 
GCTATAATAT TGCTACTCCT CTTTGGACCC 2160 
TCTTCCAGAA TCGAAGCTGT GAAACTACAA 2220 
ATCTACCGCA GACCCCTGGA CCGGCCTGCT 2280 
GGCACCCCTC CTGAGGAAAT CTCAGCTGCA 2340 
AG CAG TT AG A GCGGTGGTCG GCCAACCTCC 2400 
GGGGGACTGA GAGACAGGAC TAGCTGGATT 2460 
GGTGGGAAGC TGACCACATC C A CCTTT AAA 2520 
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CACGGCCCTT GCAACTTAGC TCACACCTGA CCAATCAGAG AGCTCACTAA AATGCTAATT 2580 
AGGCAAAGAC AGGAGGTAAA GAAATAGCCA ATCATTTATT GCCTGAGAGC ACAGGAGGAG 2640 
GGACAATGAT CGGGATATAA ACCCAAGTTT TCGAGCCGGC AACGGCAACC CCCTTTGGGT 2700 
CCCCTCCCTT TGTATGGGAG CTCTG TTTTC ATGCTATTTC ACTCTATTAA ATCTTGCAAC 2760 
TGCAAAAAAA AAAAAAAAAA A A 2782 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA (as DNA) 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TGTCCGCTGT GCTCCTGATC CAGCGAGGCG CCCATTGCCG CTCCCAATTG GGCTAAAGGC 60 
TTGCCATTGT TCCTGCACGG CTAAGTGCCT GGGTTTGTTC TAATTGAGCT GAACACTANT 120 
CACTGGGTTC CATGGTTCTC TTCTGTGACC CACGGCTTCT AATATAACTA TAACACTTAC 180 
CACATGGCCC AAGATTCCAT TCCTTGG AAT CCGTGAGGCC AAGAACTCCA GGTCAGAGAA 240 
TACGAGGCTT GCCACCATCT TGGAAGCGGC CTGCTACCAT CTTGGAAGTG GTTCACCACC 300 
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ATCTTGGGAG CTCTGTGAGC AAGGACCCCC CGGTAACATT TTGGCAACCA CGAACGGACA 360 

TCCAAAGTGA ATCGAAGCTG TAAAACTACA AATGGAGCCC AAGATCCAGT CCAAGACTAA 420 

GATCTACCGC AGACCCCTGG ACCGGCCTGC TAG CCCACG A TCTGATGTTA ATGACATCAA 480 

AGGCACCCCT CCTGAGGAAA TCTCAGCTGC ACAACCTCTA CTACGCCCCA ATTCAGCAGG 540 

AAGCAGTTAG AGCGGTCGTC GGCCAACCTC CCCAACAGCA CTTAGGTTTT CCTGTTGAGA 600 

TGGGGGACTG AGAGACAGGA CTAGCTGGAT TTCCTAGGCT GACTAAGAAT CCCTAAGCCT 660 

AGCTGG 6 ^6 



(2) INFORMATION FOR SEQ ID NO: 9: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 72 base pairs 
m (B) TYPE: nucleotide 

=P (C) STRANDEDNESS : single 

L (D) TOPOLOGY: linear 

ru io 

?Z (ii) MOLECULE TYPE: mRNA (as DNA) 

0 * 

^ (iii) HYPOTHETICAL: NO 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GACTTCCCAA ATACCAGACG AAGCAGAGTG GTTTACAGTC CTGGACCTTC AGGATGCCTT 60 

CTTCTGCATC CCTGTACATC CTGACTCTCA ATTCTTGTTT GCCTTTGAAG ATACTTCAAA 120 

CCCAGCATCT CAACTCACCT CGACTATTTT ACCCCAAGGG TTCAGGGATA GTCCCCATCT 10O 

ATTTGGCCAG GCATTAGCCC AAGACTTGAG CCAATCCTCA TACCTGGACA CTTGTCCTTC 240 
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GGTAGGTGGA TGATTTACTT TTCGCCGCCC 
AAGCGCTCTT CAATTTCCTC GCTACCTGTG 
TCTGCTCACA GCAGGTTACT TAGGGCTAAA 
GGAACACATC CACCCTATAC TGGCTTATCC 
ATTCCTTGGC GTAATAGGTT TCTGCCGAAA 
GTCATTAAAT ACACTAATTA AGGAAACTCA 
AACTGAAGTA GAAGTGGCTT TCCAGGCCCT 
AGGGCAAGAC TTTTGTTCAT ATGTCACAGA 
ACAGATCCGA CCGATGAGCT TGCAACCTGT 
GGCAAAGGGT TGACCTCATT GTTTACGGGT 
AGC AG TT AAA ATAATACAGG GAAGAGATCT 
CATACTCACT GCTAAAGGAG ACTTGTGGCT 
TCTATTACTT GAACGGCCAG TCCTGCGACT 
ATTTCTTCCA GACAATGAAG AAAAGATAAA 
CTATGCCACT CGAGGGGACC TTTTAGAGGT 
TACTGATGGA AGTTCCTTTG TAGAAAAAGG 
TGATAATGGA ATACTTGAAA GTAATCCCCT 
ACTAATAGCC CTCACTTGGG CACTAGAATT 



ATTCAGAAAC CTTGTGCCAT CAAGCCACCC 300 

GCTACATCGT TTCCAAACCA AAGGCTCAAC 360 

ATTATCCAAA GGCACCAGGG CCCTCAGTGA 420 

TCATCCCAAA ACCCTAAAGC AACTAAGGGC 480 

ATGGATTCCC AGGTATGGCC AA AT AG C C AG 540 

GAAAGCCAAT ACCCATTTAG TAAGATGGAC 6O0 

AACCCAAGCC CCAGTGTTAA GTTTGCCAAC 660 

AAAAACAGCA ATAGCTCTAG GAGTCCTTAC 720 

GGCACACCTG ACTAAGGAAA TTGATGTACT 780 

AGTGGTGGCA GTAGCAGTCT TAGTATCTGA 840 

TACTGTGTGG ACATCTCATG ATGTGAATGG 900 

GTCAGACAAC TGTTTACTTA AATGTCAGGC 960 

GTGCACTTGT GCAACTCTTA ACCCAGCCAC 1020 

ACATAACTGT CAACAAGTAA TTTCTCAAAC 1080 

TCCTTTGACT GATCCCGACC TCAACTTGTA 1140 

ACTTCGAAAA GTGGGCTATG CAGTGGTCAG 1200 

CACTCCAGGA ACTAGTGCTC AG C TAG C AG A 1260 
AGGAGAAGAA AAAAGGGCAA ATATAATACA 1320 
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GACTCTAAAT ATGCTTACCT ACTCCTCCAT 
TTCCTAACTT CTGAGAGAAC ACCTATCAAA 
GCTGTACAGA AACCTAGAGA GGTGGCAGTC 
GAAAGGGAAA TACAAGAGAA CTGCCAAGCA 
GACCCTCCAT TAGAAATGCT TATTAAACTT 
ACCAAGCCCC AGTACTCAGC AGCACAAACA 
CCCTCGGGAC GGTTAGCCAC TGAAGAAGGG 
AAATTACTTA AAACCCTTCA TCAAACCTTT 
GCCAAATCAT TATTTACTGC ACCAGGCCTT 
TGTGAAGTGT GCCAGAGAAA TAATCCCCTG 
AGAACAGGCC ATTACCCTGG AGAAGACTGG 
AGGGATTTCA GTATCTACTA GTCTGGGTAG 
CCTGTAGGAC AGAAAAGGCC CAAGAGGTAA 
GATTCGGACT TCCCCGAGGC TTACAGAGTG 
CCCAGGGAGT ATCCCAGGCG TTAGGTATAC 
AGTCCTCAGG GAAGGTCGAG AAAATGAATG 
CCCAGGAAAC CCACCTCACA TGGCCTGTTC 
ACTTTCCCCA AAAAGCAGGA CTTAGCCCAT 



GCCCATCCAG CAATATGGAA AGAAAGGGAA 1380 
CATCAGGAAG CCATTAGGAA ATTATTATTG 1440 
TTACACTGCC GGGGTCATCA CAAAGGAAAG 1500 
TATATTGAAG CCAAAAGAGC TGCAAGGCAG 1560 
CCCTTAGTAT AGGGTAATCC CTTCCGGGAA 1620 
GAATGGGGAA CCTCACGACC CAGTTTTCTC 1680 
AAAATACTTT TGCCTGCAAC TATCCAATGG 1740 
CACTTAGGCA TCGATAGCAC CCATCAGATG 1800 
TTCAAAACTA TCAAGCAGAT AGTCAGGGCC 1860 
CCTTATCGCC AAGCTCCTTC AGGAGAACAA 1920 
CAACTGATTT TACCCACAAG CCCAAACCTC 1980 
ATACTTTCAC GGGTTGGGCA GAG G CCTTCC 2040 
TAAAGGCACT AGTTCATGAA ATAATTCCCA 2100 
ACAATAGCCC TGCTTTCCAG GCCACAGTAA 2160 
GATATCACTT ACACTGCGCC TGAAGGCCAC 2220 
AAACACTCAA AGGACATCTA AAAAAGCAAA 2280 
TGTTGCCTAT AGCCTTAAAA AGAATCTGCA 2340 
ACGAAATGCT GTATGGAAGG CCCTTCATAA 2400 
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CCAATGACCT TGTCCTTGAC CCAAGACACC CAACTT AG TT GCAGACATCA CCTCCTTAGC 2460 
CAAATATCAA CAAGTTCTTA AAA C ATT AC A AGGAACCTAT CCCTGAGAAG ACGAAAAGAA 2520 
TATTCCACCC AAGTGACATG GTATTAGTCA AGTCCCTTCC CTCT AATTCC CCATCCCTAG 2580 
ATACATCCTG GGAAGGACCC TACCCAGTCA TTTTATCTAC CCCAACTGCG GTTAAAGTGG 2640 
CTGGAGTGGA GTCTTGGATA CATCACACTT GAGTCAAATC CTGGATACTG CCAAACGAAC 2700 
CTGAAAATCC AGGAGACAAC GCTAGCTATT CCTGTGAACC TCTAGAGGAT TTGCGCCTGC 2760 
TCTTCAAACA ACAACCAGGA GGAAAAATCG AAGCTGTAAA ACTACAAATG GAGCCCAAGA 2820 
TGCAGTCCAA GACTAAGATC TACCGCAGAC CCCTGGACCG G CCTGTT AG C CCACGATCTG 2880 
ATGTTAATGA CATCAAAGGC ACCCCTCCTG AGGAAATCTC AGCTGCACAA CCTCTACTAC 2940 
CCCCCAATTC AGCAGGAAGC AGTTAGAGCG GTCCTCGGCC AACCTCCCCA ACAGCACTTA 3000 
GGTTTTCCTG TTGAGATGGG GGACTGAGAG ACAGGACTAG CTGGATTTCC TAGGCTGATT 3060 
AAGAATCCCT AAGCCTAGCT GGGAAGGTGA CCACATCCAC CTTTAAACAC GGGG CTTGC A 3120 
ACTTAGCTCA CACCTGACCA ATC AG AG AG C TCACTAAAAT GCTAATTAGG CAAAGACAGG 3180 
AGCTAAAGAA ATAGCCAATC ATTTATTGCC TGAGAGCACA GCAGGAGGGA CAATGATCGG 3240 
GATATAAACC CAAGTTTTCG AGCCGGCAAC GGCAACCCCC TT TGGGTCCC CTCCCTTTGT 3300 
ATGGGAGCTC TG T T T TCATG CTATTTCACT CTATTAAATC TTGCAACTGC AAAAAAAAAA 3360 
AAAAAAAAAA AA 3372 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2372 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

REPLACEMENT SHEET (RULE 26) 
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(ii) MOLECULE TYPE: mRNA (as DNA) 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ACTGCACTCT TCTGGTCCAT GTTTCTTACG GCTCGAGCTG AG CTTTTG CT CACCGTCCAC 60 

CACTGCTGTT TGCCACCACC GCAGACCTGC CGCTGACTCC CATCCCTCTG GATCCTGCAG 120 

GGTGTCCGCT GTGCTCCTGA TCCAGCGAGG CCCCCATTGC CGCTCCCAAT TGGGCTAAAG 180 

GCTTGCCATT GTTCCTGCAC GGCTAAGTGC CTGGGTTTGT TCTAATTGAG CTGAACACTA 240 

ATCACTGGGT TCCATGGTTC TCTTCTGTGA CCCACGGCTT CTAATAGAAC TATAACACTT 300 

ACCACATGGC CCAAGATTCC ATTCCTTGGA ATCCGTGAGG CCAAGAACTC CAGGTCAGAG 360 

AATACGAGGC TTGCCACCAT CTTGGAAGCG GCCTGCTACC GTCTTGGAAG TGGTTCACCA 420 

CCATCTTGGG ACCTCTGTGA GCAAGGACCC CCCGGTAACA TTTTGGCAAC CAACGACGGA 480 

CATCCAAAGT GATGGGAAAC GTTCCCCGCA AGACAAAAAC GCCCCTAAGA CGTATTCTGG 540 

AGAATTGGGA CCAATTTGAC CCTCAGACAC T A AG AA AG AA ACGACTTATA TTCTTCTG CA 600 

GTGCCGCCTG GC A CTCCTG A GGGAAGTATA AATTATAACA CCATCTTACA GCTAGACCTC 660 
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TTTTGTAGAA AAGGCAAATC GAGTGAAGTG 
CACAACTCAC AATTATGTAA AAAGTGTGAT 
CCTCCCTATC CCAGCATCCC CGACTCCTTC 
AATGGTCCAA AAGGAGATAG A C AAA AGGGT 
CCAATTATGA CCCCTCCAAG CAGTGGGAGG 
GCCTTTTTCT CTCCCAGACT TAAAGCAAAT 
CCCTGATGGC TATATTCATC TTTTACAAGG 
AGATATAATC TCACTGCTAA ATCAGACACT 
TGCAGCCTGA GGGTTTGGCG TCTCTGCTAT 
CAGAAGGAAA GANAATGATT CCCCACAGGC 
TGGGACACAG AATCAGAACA TGGAGATTGG 
GAAGGACTAA GGAAAACTAG GAAGAAGTCT 
CAGGGAAGGG AACAAAATCC TACTGCCTTT 
CGTGCCTCTC TGTCACCTGA CTCTTCTGAA 
ACTCAGTGAG CTGCAGACAT TAGAAAAAAC 
ACTTAGAAAC CCTATTGAAC TTGGCAACCT 
AGGCGGAACA GGACAAACGG GATTAAAAAA 
CAAGTGGACT TTGCAGGCTC TGGAAAAGGG 



CCATAAGTAC AAACTTTCTT TTCATTAAGA 720 

TTATGCCCTA CAGGAAG CCT TCAGAGTCTA 780 

CCCAACTAAT AAGGACCCCC CTTCAACCCA 840 

AAA C AGTG AA CCAAAGAGTC CCAATATTCC 900 

AAGAGAATTC GGCCCAGCCA GAGTGCATGT 960 

AAAAACAGAC TTAGGTAAAT TCTCAGATAA 1020 

GTTAGGACAA TTCTTTGATC TGACATCGAG 1080 

AACCCCAAAT GAGAGAAGTG CCACCATAAC 1140 

CTCAGTCAGG TCAATGGATA NGGATGACAA 1200 

CAGCAGGCAG TTCCCAGTCT AGACCCTGAT 1260 

TGCTGCAGAC ATTTGCTAAC TTGTGTGCTA 1320 

ATGAATTACT CAATGATGTC CACCATAACA 1380 

CTGGAGAGAC TAAGGGAGGC ATTCAGGAAG 1440 

GGCCAACTAA TCTTAAAGCG TAAGTTTATC 1500 

TTCAAAAGTC TGCCGTAGGC CCGGAGCAAA 1560 

CGGTTTTTTA TAATAGAGAT CAGGAGGAGC 1620 

AAGGCCACCG CTTTAGTCAT GACCCTCAGG 1680 

AAAAGCTGGG CAAATTGAAT GCCTAATAGG 1740 
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GCTTGCTTCC AGTCCGCTCT ACAACGACAC TTTAAAAAAG ATTGTCCAAG TAGAAGTAAG 1800 
CCGCCCCTTC GTCCATGCCC CTTATTTCAA GCGAATCACT GGAAGGCCCA CTGCCCCAGG 1860 
GGACAAAGGT CTTTTGAGTC AG AAG CCACT AACCAGATGA TCCAGCACCA GGACTGAGGG 1920 
TGCCTGGCGC AAGCGCCATC CCATGCCATC ACCCTCACAG AGCCCTGGGT ATGCTTGACC 1980 
ATTCAGGGCC AGGAAGGTTG TCTCCTGG AC ACTGGTGCGG TCTTCTTAGT CTTACTCTTC 2040 
TGTCCCGGAC AACTGTCCTC CAGATCTCTC ACTATTCTGA GGGGGTCCNT AAGACGGGCA 2100 
GTCACTAGAT ACTTTTTCCC AG CCACT AAG TTATGAACTG GGGAGCTTTA TTCTTTTCAC 2160 
ATGCTTTTCT AATTATGCTT GAAAGCCCCA CTACCTTGTT AGGGAGAGAC ATTCTAGCAA 2220 
AAGCAGGGGC CATTATACAC CTGAACATAG GAGAAGGAAC ACCCGTTTGT TGTNCCCCTG 2280 
CTTGAGGAAG GAATTAATCC TGAAGTCTGC GCAACAGAAG GACAATATGG ACG AG CCAAA 2340 
GAATGCCCGT CCTGTTCAAG T T A AACT AAA GG 2372 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7582 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mRNA (as DNA) 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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C AATTGGG CT 
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CATTG TN CCT 
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GGGTTCCATG 


G TT CTCTT CT 


GTGACCCACG 


G CTTCT AAT A 


KAACTATAAC 


ACTTACCACA 


420 
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TCTGGARAAT 
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XXXXXJlwX V— W 


PAfSAPTTAAA 


G C A A ATA AAA 


ACAGACTTAG 


GTAAATTCTC 


AG ATAAY CCT 


1140 


un x uuv« x n X n 


XXWJ^XUXX X X 


ACAAGGGTTA 


GGACAATTCT 


TTGATCTGAC 


ATGGAGAGAT 


1200 


AT AT ATG TC A 
i x n x • » x w x w»^» 


CTGCTAAATC 


AGACACTAAC 


CCCAAATGAG 


AGAAGTGCCA 


CCATAACTGC 


1260 


AGCCTGACRC 


TTTGG CG AT C 


TCTGGTATCT 


CAGTCAGGTC 


AATGGATANG 


GATGACAACA 


1320 


GAAGGAAACA 


NAATGATTCC 


CCACAGGCCA 


GCARGCAGTT 


CCCAGTCTAS 


ACCCTCATTG 


1380 


GGGACACAGA 


AATCAGTAAC 


ATGGGAGATT 


GGTGCTGCAG 


ACATTTGCTA 


ACTTGTGTGC 


1440 


TA&AAGGACT 


AAGGAAAACT 


ASGAAGAAAK 


TCTAYGAATT 


ACTCAATGAT 


GTCCACCATA 


1500 


A CAC AGG GG A 


AGGGAAGAAA 


ATCCTACTGC 


CTTTCTGGAG 


AG ACT AAGG G 


AGGCATTGAG 


1S60 


GAAGCGTGCC 


TCTCTGTCAC 


CTG A CTC TTC 


TGAAGGCCAA 


CTAATCTTAA 


AGCGTAAGTT 


1620 




A LAuU 1 VJ UAu 


APATTAflAAA 
/\w/\ X inunnn 


AAA CTTC AAA 


AGTCTGCCGT 


AGGCCCGGAG 


1680 


CAAAACTTAG 


AAACCCTATT 


GAACTTGGCA 


ACYTCGGTTT 


TTTATAATAG 


AGATCAGGAG 


1740 


GAGCAGGCGG 


AACAGGACAA 


ACGGGATTAA 


AAAAAAGGCC 


ACCGCTTTAG 


TCATGACCCT 


1800 


CAGGCAAGTG 


GACTTTGGAG 


GCTCTGGAAA 


AGGGAAAAGC 


TGGGCAAATT 


GAATGCCTAA 


1860 


T AGG G CTTG C 


TTCCAGTGCG 


GTCTACAAGG 


ACACTTTAAA 


AAAGATTGTC 


CAAGTAGAAG 


1920 


TAAGCCGCCC 


CTTCGTCCAT 


GCCCCTTATT 


T C AAGG G AAT 


CACTGGAAGG 


CCCACTGCCC 


1980 


CAGGGGACAA 


AGGTCTTTTG 


AGTCAGAAGC 


CACTAACCAG 


ATG ATCC AG C 


AGCAGGACTG 


2040 


AGGGTGCCTG 


GGGCAAGCGC 


CATCCCATGC 


CATCACCCTC 


ACAGAGCCCT 


GGGTATGCTT 


2100 
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G A CCATTG AG GGCCAGGAAG GTTGTCTCCT 

CTTCTGTCCC GGACAACTGT CCTCCAGATC 

GGCAGTGACT AGATACTTTY TCCCAGCCAC 

TCACATGCTT TTCTAATTAT GCTTGAAAGC 

GCAAAAGCAG GGGCCATTAT ACACCTGAAC 

CCTGCTTGAG GAAGGAATTA ATCCTGAAGT 

CAAAGAATGC CCGTCCTGTT CAAGTTAAAC 

GGCAGTACCC CCTCAGACCC AAGGCCCAAC 

AAGCCCAAGG CTTAGTAAAA CCATGCATAA 

GAGGCACAGA AACCC AG TGG ACAGTGGAGG 

AGGCCGTTGT CCTTTTATAC CCAGCTGTAC 

CAGAGGAAGC AGAGTGGTTT ACASTCCTGG 

TACATCCTGA CTCTCAATTC TTGTTTGCCT 

TCACCTGGAC TRTTTT A CCC CAAGGGTTCA 

TAG CCC AAG A CTTGAGYCAR TYMTCATACC 

ATTTACTTTT RGCYGCCYRT TCAGAAACCT 

ATTTCCTCGC YACCTGTGGC TACAWGGTTT 

AGGTTAAATA CTTAGGRCTA ARATTATCCA 

TCCAGCCTAT ACTGGCTTAT CCTCATCYCA 

G CRT AA Y AG G YTTCTGCCGA AWATGGATTC 

WATACASTAA TTAAGGAAAC TCAGAAAGCC 

GYMRAAGTGG CTTTCCAGGC C C CT AA AG AA 

TGCCAACRGG GCAAGACTTT TSTTYATAYR 

GTCCTTACAC AGRTCCRAGG GAY G AGCTTG 

GATGTAGTGG C AAAGGG TTG RCYTCATTGT 

GTATCTGAAG CAGTTAAAAT AATACAGGGR 

GTGAAYRGCA TACTCACTGC TAAAGGAGAC 

TRTCAGGCTC TATTACTTGA ARGGCCAGTG 

CCAGYCNCAT TTCTTCCAGA CAATGAAGAA 

TCTCAAACCT ATGCCACTCG AGGGGACCTT 

CAACTTGTAT ACTGATGGAA GTTCCTTTGT 

AG TGG TC AG T GATAATGGAA T A Y TTG A AAG 

G CTRGCAG AA CTAATAGCCY TCAYTKGGGC 

TATATATACA GACTCTRART ATGCTYACCT 

AGAAAGGGAA TTCCTAACTT CYGAGRGAAC 

ATTATTAYTG GCWGTACAGA AACCTARAGA 



GGACACTGGT 


GCGGTCTTCT 


TAGTCTTACT 


2160 


TGTCACTATT 


CTGAGGGGGT 


CCNTAAGACG 


2220 


TAAGTTATGA 


ACTGGGGAGC 


TTTATTCTTT 


2280 


CCCACTACCT 


TGTTAGGGAC 


AGACATTCTA 


2340 


ATAGGAGAAG 


GAACACCCGT 


TTGTTGTNCC 


2400 


CTGGG CAACA 


GAAGGACAAT 


ATGGACGAGC 


2460 


TAAAGGATTC 


CACTTCCTTT 


CCCTACCAAA 


2520 


AAGGATTCCA 


AAAGATTGTT 


AAGGACTTAA 


2S80 


CTCCCTGCAG 


TAATTCCGTA 


GTGGATTGAG 


2640 


GTTAGTGCAA 


GATCTCAGGA 


TTATCAATGG 


2700 


CTAGCCCTTA 


TACTGTGMYT 


TCCCAAATAC 


2760 


ACCTTMAGGA 


TGCCTTCTTC 


TGCATCCCTG 


2820 


TTGAAGATAC 


TTCAAACCCA 


RCATCTCAAC 


2880 


GGGATAGYCC 


CCATCTATTT 


GG CCAGG CAT 


2940 


TGGACACTCT 


TGTCCTTCRG 


TAKGTGGATG 


3000 


TGTGCCATCA 


AGCCACCCAA 


GCRCTCTTMA 


3060 


CCAAACSARA 


RGCTCARCTC 


TGCTCACAGC 


3120 


AAGGCACCAR 


GGCCCTCAGT 


GAGGAAYRYA 


3180 


AAACCCTAAA 


GCAACTAAGR 


GRRTTCCTTG 


3240 


CCCAGGTWTG 


GCRAAATAGC 


CAGG Y C ATT A 


3300 


AATACCCATT 


TARTAAGATG 


GAYAMCTGAA 


3360 


GGCCTTAAAC 


CCAAGYCCCA 


GTGTTAAGYT 


3420 


TCACAGAAAA 


AAACAGRAAY 


AGCTCTRGGA 


3480 


CAACCYRTCG 


CRYACCTGAS 


TAAGGAAAYT 


3540 


TTAYGGGTAG 


TGGTGGCAGT 


AGCAGTYKTA 


3600 


AGAGATCTTA 


CTGTGTGGAC 


ATCTCATGAK 


3660 


TTGTGGCTGT 


CAGACAACYG 


TTTACTTAAA 


3720 


CTGCRACTGT 


GCACTTGTGC 


AACTCTTAAC 


3780 


AAGATARAAY 


ATAACTGTCA 


ACAARTAATT 


3840 


YTAGARGTTC 


CYTTGACTGA 


TCCYGACCTT 


3900 


AGAAAAAGGA 


CTTCGAAAAG 


YGGGGTATGC 


3960 


TAATCCCCTC 


ACTCCAGGAA 


CTAGTGCTYA 


4020 


ACTAGAATTA 


GGAGAAGRAA 


AAAGGG Y AAA 


4080 


AGTCNTCCAT 


GCCCATGMRG 


CAATATGSAR 


4140 


ACCTATCAMA 


CATCAGGAAG 


CCATTAGGAR 


4200 


GGTGGMAGTC 


TTACACTGCY 


GGGGTCATCA 


4260 



REPLACEMENT 
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NAAAGGAAAG 


RAAAGGGAAA 


TA5AAGRGAA 


YTGCCAAGCA 


KATATTGAAG 


CHAAAAGAGC 


4320 


TGCAAGGCAG 


GACCCTCCAT 


TAGAAATGCT 


TATTAAACTT 


CCCTTAGTAT 


AGGGTAATCC 


4380 


CTTCCGGGAA 


ACCAAGCCCC 


AGTACTCAGC 


AGGAGAAACA 


GAATGGGGAA 


CCTCACGAGG 


4440 


CAGTTTTCTC 


CCCTCGGGAC 


GGTTAGCCAC 


TGAAGAAGGG 


AAAATACTTT 


TGCCTGCAAC 


4500 


TATCCAATGG 


AAATTACTTA 


AAACCCTTCA 


TCAAACCTTT 


CACTTAGGCA 


TCGATAGCAC 


4560 


CCATCARATG 


GCCAAATCAT 


TATTTACTGG 


ACCAGGCCTT 


TTCAAAACTA 


TC AAG CARAT 


4620 


AKTCAGGGCC 


TCTGAAKTGT 


G CC ARARAAA 


TAATCCCCTG 


CCTYATCGCC 


AAGCTCCTTC 


4680 


ACCARAACAA 


ARAACAGGCC 


ATTACCCTCR 


ARAARACTGG 


CAACTGATTT 


TACCCACAAG 


4740 


CCCAAACCTC 


AGGGATTTCA 


G T A TCT A CT A 


GTCTGGCTAR 


ATACTTTCAC 


GGGTTGCGCA 


480O 


RAGGCCTTCC 


CCTGTAGGAC 


AGAAAAGGCC 


CAAGAGGTAA 


TAAAGGCACT 


AGTTCATGAA 


4860 


ATAATTCCCA 


GATTCGGACT 


TCCCCGAGGC 


TTACAGAGTG 


ACAATAGCCC 


TGCTTTCCAG 


4920 


GCCACAGTAA 


CCCAGGGAGT 


ATCCCAGGCG 


TTAGGTATAC 


GATATCACTT 


ACACTGCGCC 


4980 


TGAAGGCCAC 


AGTCCTCAGG 


GAAGGTCGAG 


AAAATGAATG 


AAAYACTCAA 


AGGACATCTA 


5040 


AAAAAGCAAA 


CCCAGGAAAC 


CCACCTCACA 


TGGCCTGYTC 


TGTTGCCTAT 


AGCCTTAAAA 


5100 


AGAATCTGCA 


ACTTTCCCCA 


AAAAG CAGG A 


CTTAGCCCAT 


ACGAAATGCT 


GTATGGAAGG 


S160 


CCCTTCATAA 


CCAATGACCT 


TGTGCTTGAC 


CC AAG AC AGC 


CAACTTAGTT 


GCAGACATCA 


5220 


CCTCCTTAGC 


CAAATATCAA 


C AAG TTCTT A 


AAA C ATT AC A 


AGGAACCTAT 


CCCTGAGAAG 


5280 


AGGGAAAAGA 


ACTATTCCAC 


CCWWGTGACA 


TGGTATTAGT 


CAAGTCCCTT 


CYCTCTAATT 


5340 


CCCCATCCCT 


AGATACATCC 


TGGGAAGCAC 


CCTACCCAGT 


CATTTTATYT 


ACCCCAACTG 


5400 


CGGTTAAAGT 


GGCTGGAGTG 


GAGTCTTGGA 


T AC AT C AC AC 


TTGAGTCAAA 


TCCTGGATAC 


5460 


TGCCAAAGGA 


ACCTGAAAAT 


CCAGGAGACA 


ACGCTAGCTA 


TTCCTGTGAA 


CCTCTAGAGG 


5520 


ATTTG CG CCT 


GCTCTTCAAA 


CAACAACCAG 


GAGGAAAGTA 


ACTAAAATCA 


TAAATCCCCC 


5580 


ATGGSCCTCC 


CTTATCATAT 


TTTTCTCTKT 


ASTGTTSTTT 


YACCCTSTTT 


CACTCTCACT 


5640 


GCACCCCCTC 


CATGCCGCTG 


TATGACCAGT 


AGCTCCCCTY 


ACCMAGAGTT 


TCTATGGAGA 


5700 


ATGCAGCGTC 


CCGGAAATAT 


TG ATG CCCCA 


TCGTATAGGAG TCTTTSTAAG GGAACCCCC 


5760 


ACCTTCACTG 


CCCACACCCA 


TATGCCCCGC 


AACTC CT ATC 


ACTCTGCCAC 


TCTTTGCATG 


5820 


CATGCAAATA 


CTCATTATTG 


GACAGGAAAA 


ATGATTAATC 


CTAGTTGTCC 


TGGAGGACTT 


5880 


GGAGTCACTG 


TCTGTTGGAC 


TTACTTCACC 


C AAACTGG T A 


TGTCTGATGG 


GGGTGGAGTT 


5940 


CAACATCAGG 


CAAGAGAAAA 


ACATG T AAA A 


GAAGTAATCT 


CCCAACTCAC 


CSGGGTACAT 


6000 


GGCACCTCTA 


GCCCCTACAA 


AGGACTAGAT 


CTCTCAAAAC 


TACATGAAAC 


CCTCCGTACC 


6060 


CATACTCGCC 


TGGTAAGCCT 


ATTTAATACC 


ACCCTCACTG 


GGCTCCATGA 


GGTCTCGGCC 


6120 


CAAAACCCTA 


CTAACTGTTG 


GATATGCCTC 


CCCCTGAACT 


TCARGCCATA 


TGTTTCAATC 


6180 


CCTGTACCTG 


AACAATGGAA 


CAACTTCAGC 


ACAGAAATAA 


ACACCACTTC 


CGTTTTAGTA 


6240 


GGACCTCTTG 


TTTCCAATST 


CGAAATAACC 


CATACCTCAA 


ACCTCACCTG 


TGTAAAATTT 


6300 


AGCAATACTA 


CATACACAAC 


CAACTCCCAA 


TGCATCAGGT 


GGGTAACTCC 


TCCCACACAA 


6360 


ATAGTCTGCC 


TACCCTCAGG 


AATATTTTTT 


GTCTGTGGTA 


CCTCAGCCTA 


TCGTTGTTTG 


6420 



REPLACEMENT 
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AATGG CTCTT 


CAGAATCTAT 


GTGCTTCCTC 


TCATTCTTAG 


TGCCCCCYAT 


GRCCATCTAC 


6460 


ACTGAACAAG 


ATTTATACAG 


TTATGTCATA 


TCTAAGCCCC 


GCAACAAAAG 


AGTACCCATT 


6540 


CTTCCTTTTG 


TTATAGGAGC 


AGGACTGCTA 


GGTGCACTAG 


GTACTGGCAT 


TGCCGGTATC 


6600 


ACAACCTCTA 


CTCAGTTCTA 


CT ACAAA CT A 


TCTCAAGAAC 


TAAATGGGGA 


CATGGAACGG 


6660 


GTCGCCGACT 


CCCTGGTCAC 


CTTGCAAGAT 


CAACTTAACT 


CCCTAGCAGC 


AGTAGTCCTT 


6720 


CRAAATCGAA 


G AGCTTTAG A 


CTYGCTAACC 


GCTGARAGAG 


GGGGAACCTG 


TTTATTTTTA 


6780 


GGGGAAGAAT 


G CTCTTATTA 


TCTTAATCAA 


TCCGGAATCC 


TCACTGAGAA 


AGTTRAAGAA 


6840 


ATTCSAGATC 


GAATACAACG 


T AKAG C AG A R 


GAGCTTCGAA 


ACACTGGACC 


CTGGGCCCTC 


6900 


CTCAGCCRAT 


GGATGCCCTG 


GATTCTCCCC 


TTCTTAGGAC 


CTCTAGCAGC 


TATAATATTG 


6960 


CTACTCCTCT 


TTGGACCCTG 


TATCTTTRAC 


CTCCTTGTTA 


ACTTTGTCTC 


TTCCAGAATC 


7020 


GAAGCTGTRA 


AACTACAAAT 


GGAGCCCAAG 


ATGCAGTCCA 


AGACTAAGAT 


CTACCGCAGA 


7080 


CCCCTGGACC 


GGCCTGYTAG 


CCCACGATCT 


GATGTTAATG 


ACATCAAAGG 


CACCCCTCCT 


7140 


GAGGAAATCT 


CAGCTGCACA 


ACCTCTACTA 


CGCCCCAATT 


CAGCAGGAAG 


CAGTTAGAGC 


7200 


GGTSGTCGGC 


CAACCTCCCC 


AACAGCACTT 


AGGTTTTCCT 


GTTGAGATGG 


GGGACTGAGA 


7260 


CACAGGACTA 


GCTGGATTTC 


CTAGGCTCAY 


TAAGAATCCY 


TAAGCCTAGS 


TGGGAAGGTG 


7320 


ACCACATCCA 


CCTTTAAACA 


CGGGGCTTGC 


AACTTAGYTC 


ACACCTGACC 


AATCAGAGAG 


7380 


CTCACTAAAA 


TG CTAATTAG 


GCAAAGACAG 


GAGGTAAAGA 


AATAGCCAAT 


CATYTATTGC 


7440 


KTGAGAGCAC 


AGCAGGAGGG 


ACAATGATCG 


GGATATAAAC 


CCAAGTYTTC 


GAGCCGGCAA 


7500 


CGGCAACCCC 


CTTTGGGTCC 


CCTCCCTTTG 


T ATGGG AC CT 


CTGTTTTCAT 


GCTATTTCAC 


7560 


TCTATTAAAT 


CTTGCARCTG 


CR 








7582 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2563 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ACTGCACTCT TCTGGTCCAT GTTTG TTACG GCTCGAGCTG AGCTTTTGCT CGCCATCCAC 60 

CACTGCTGTT TGCCACCGTT GCAGACCCAC TGCTGACTTC CATCCCTCTG G ATCTGG C AG 120 

GGTGTCTGCT GTGCTCCTGA TCCAGCGAGG GGCCCATTGC CACTCCCAAT CGGGCTAAAC 180 

GCTTGCCATT GTTCCTGCAT GGCTAAGTGC CCAGGTTCAT CCTAATTGAG CTGAACACTA 240 

GTCACTGGGT TCCACAGTTC TCTTCCATGA ACCACGGCTT TTAATAGAGC TATAACACTC 300 

ATCCCAAGGC CCAAGATTCC ATTCCTTGGA ATCTGTGAGC CCAACAACCC TAGGTCAGAG 360 

AACACGAGGC TTGCCACCAT CTTGGAAGCA GCCTGCCACC ATCTGGGAAG CGGCCTGCCA 420 

CCATCTTGGA AGCCGCCCGC CACCATCTTG GGAGCTCTGG GAGCAAGGAC CTCCCCGCAA 480 

CCCAGTAACA TTTAGCGACC ACGAAGGGAC CTCCAAAGCG GTAATATTGG ACCACTTTCA 540 

CTTGCTATTC TGTCCTATCC TTCCTTAGAA TTGGAGGAAA ATACCGGACA CCTGTCGGCC 600 

GGTTAAAAAC GATTAGCGTG GCCTCCGGAC TTAAGAATCA GGTGTGAGGC T ATCTGG GG A 660 

AGG CCTTTCT AACAACCCCC AACCRTTCTG GGTTGGGAAT GTTGGTCTGC CTGGAG CC AG 720 

CTTCCACTTT CAATTTTCCT GGGGAAGCCA AGGG CCG ACT AGAGGCAGAA AGCTGTTGTC 780 

CCAAATTCCC GGCAGTAGCC GGTTGAGATC ATGGCGCAGC CAGAAGTCTT TACTCCACAG 840 

TCACCCATGC ATGOGCCCCT ATCTTTCCTT CTGACCCATA CCTCCTGGGT CCTAACCATG 900 

ACTTTCTTAA AAGGGTAGCC CCAAAATTCT CCTTACCTCT GAATCTACTT CCTCTGATCC 960 

CTGCCTCCTA GGTGCTAATG GTTCACACTT TCATTTCCTC TAGCAAGTTG TATYTCCAAA 1020 

GGGATATAAG GAAGCTCTAC ACTGTATCCT TAGGCATCTA GGCTCTAAAC CCAGGGAGTC 1080 
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TTGTCCCTGA TCTCCCAACC GATTTAGGTA 
ACCCATTCCC CACCACCCTT GCCAGGGCCC 
GAGAGAGAGA GAG AC AG AG T GAGACACAGA 
GGAGAGAGAC ACAGAGAGGG GAGAGACACA 
GGAGTCYMAG AGAGAGAGAA AGAAGAAGAA 
TTAAAAGCCA GGGTAAATTT AAAAAACCTA 
CCTGTAACAC TCTAATACTA CCTTGTTCTC 
CTGAGACCGC TGACACCCAT AGCTTTCCTA 
TGGCCCGCAT GCATTCAATC TGTAGTGGCA 
AG T AACTTTT AGAGGAAACC TCATTCTGAG 
TCAAAAAAGC AAA AAGG T A G CTTACTAACT 
TAGAAAAAGG TAATTTAACA CTAATCACTG 
CAGGAGATTT AAATCTTAAT TACCATACAA 
TCAGTACAGG ATGATAGATG GTTCCTCCCA 
TCAGTAATTG ATAGGGAGAC TCTTGTGGAA 
TCTCCCCAAA CCTGCGAGCT GTTTG CACTC 
AAAGATTATC TCAATCCTGA CTCAAAAGGT 
TAAGAACTGT TTATGGGACT G CATCTTG AT 



TATAGTTCTC CACATCCGCA GTTATGTCGC 1140 
CAAGTTTGTA AATGG CTAAG AGAGGAAAGT 1200 
GAGAGGGAGA GACAGAGAGA GAGAGAGAGA 1260 
GAGAGGAGAA GGGGGCAGAG AGACCAAGAG 1320 
ATAGTAGAAA AAAAAGTGTG CCCTATTCCT 1380 
TACTTGATAA TTCAAGGTCT TCTCCATGAC 1440 
AGTGTAAACA AGGGTGTTAG CCTG A AAA C A 1500 
TAAAAAATCC TTAACCCAGT AACCCGCAGA 1560 
ACTGCTTTGC TAACAAGAAT AAAGTGGAAA 1620 
CACACCTCAC CAGTTCAGAA TTATTCTAAG 1680 
CAAAAATCTT AAAGTATGGG GTTATTTTGT 1740 
ATAATTCCCT TAACCCAGAA GATTTCCTAA 1800 
AGGTCTGACC AGACCTAGGA GGAACTCCCT 1860 
GGTGAATGAA AAAAAAATCA CAATGGGTAT 1920 
GCAGAGTTAG AAAAACTGCC TAATAATTGG 1980 
AGCCAAGCCT TAAAGTACTT CTAGAATCAA 2040 
TACCTACACC CTCTGTGAAA CGAATTTACT 2100 
GGGGCAGCTG GGTTGTCATG AAATACT C AG 2160 
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GAATGCAGCC TAGCTCTAGG ACTCACCCCT GAGCACAAAG GCAATGTTGG GCATGCTGGT 2220 
AAAGGACCAC TAGAATCCAG CAGTCCGAAC CCTTTCTTTG GGTTAAGAAA GGCGGGAAAA 2280 
CAGGCGCAGG ACTGCTACAT TGGTAAGCGT AACTAATCCA ATAAGCAGAG GTCCATGGGT 2340 
GCTGACACAC TCTGGAAAGG AATAAGCATT AGRACCATAG AGGACGCTCT ACGACTAATC 2400 
CTCGTCGGAA AATGACTAGA GGTGCTGGCA TCCCTATGTT C TTTTTTC AG ATGGGAAATG 2460 
TTCCCCCTCA AGGCAAAAAC ACCCCTAAGA TGTATTCTGG AC AATTGGG A CCAATTTGAC 2520 
CCTCAGACTC TAAGAAAGAA ACGACTTATA TTCTTCTGCA GTG 2563 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 585 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TCAGGGATAG CCCCCATCTA TTTGGCCAGG TATTAGCCCA AGACTTGAGC CACTTCTCAT 60 

ACTTGGACAC TCTTGTCCTT TGGTATGTGG ATGATCTACT TTTAGCCACC TG TT C AG AAA 120 

CCTTGTGCCA TCAAGCCAAC CAAGTGCTCT TAAACTTCCT CGCCACCTGT GGCTACAAGG 180 

TTTCCAAACC AGAGGCTCAG CTCTGCTTAC AGCAGGTTAA ATACTTAGGG CTAAAATTAT 240 
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CCAAAGG C AC CAGGGCCCTC AGTGAGGAAC 
CCAAAACCCT GAAGCAATTA AGAGGGTTCC 
TTCCCAGGTA CAATGAAATA GCCAGCCCAT 
CCAATACCCA TTTAGTAGAA TGGACACCTG 
AGGCCCTAAT CCAAGCCCCA GTGTTAAGCT 
TCACAGAAAA AAAAACAGGA ATAGCTCTAG 
TACAACACAT GGCATACCTG AGTAAGGAAA 
GTTTACAGGT AGTGGCAGCA GTAGCAGTCT 
GAAGANATCT TACTGTGTGG ACATCTCATG 
ACTGTGG CTG TCAGACAACC ATTTGCTTAA 
GCTGCCACTG TGCACTTGTG CAACTCTTAA 
AAAGATAGAA CATAACTGTC AACAAGTGAT 
TCTAGAGGTT CCCTTGACTG ATCCTGAGCT 
TAGAAAAAGG ACTTCGAAAG GCGGG TATGC 
TAATCCCTTC ACTCCAGAAA CTAGCATTCA 
ATTAGAACAC AGGAGAAGGA AAAGGAGTAA 
TAGTCCTCCA TGCCCATGCA GCAATATAGA 
CACCTATCAA ACATCAGGAA GCCATTAGGA 



GTATCCAGCC TATACTGGCT TATCCTCATC 300 

TTGG CAT AAA AG G CTG CTG T TGAATATGGA 360 

TATACACACT AATTACCGGA ACTCAGAAAG 420 

AAGCAGAAGC GGCTTTCCAG GCCCTAAAGA 480 

TGCCAATGGA GCAAGACTTT TCTTTATATG 540 

AAGTCCTTAC ACAGGTCCGA GGGACCAGCT 600 

CTGATGTAGT GGCAAAGGGT TGGACTCATT 660 

TAG C ATCTG A AGCAGTTAAA ATGATACAGG 720 

ATGTGAACGG CATACTCACT GCTAAAGGAG 780 

ATATCAGGCT CTATCACTTG AANGGCCAGT 840 

CCCACCCACA TTTCTTCCAG ACAATGAAGA 900 

TGTTCAAACC TACACCGCTC GAAGGGACCT 960 

CAACTTCTAT ACTGATCGAA GTTCCTTTTG 1020 

AGTGGCCAGT GATAATGGAA TACTTGAAAG 1080 

GCTGGCAGAA TTAATAG CCT TCACTTGGGC 1140 

ATATATATAC AGACTCCAAG TATGCTTACT 1200 

GAGAAAGCGA ATTCCTAACT TCTGAGGGAA 1260 

GATTATTACT GGCTGTACAG AAACCTAGAG 1320 



REPLACEMENT 
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GTGGCAGTCT TACATGGCCG AGATCATCAG 
TGCCAAGTGG ATATTGAAGC CAAAAGAGCT 
ATAGAAGGAC CCCTAGTACA GGGCAATCCC 
GAAGAAATGG AATGGGGAAC CTCATGAGGA 
ACCAAAGAAG CAAAAATACT TTTGCCTGCA 
CACCAAACCT TTCGCTTAGG CATTGATAGC 
AGACCACACC TTTTCAAAAC TATCAAGCAG 
AATAATCCCC TGCCTTATCG CCAAACTCCT 
GGAGAAGAGT CGCAACTAGA TTTTACCCAC 
CTAGTCTGGG TAGATACTTT CACTGGTTGG 
GCCCATGAGG TAATAAAGGC ACTAATTCAT 
GGCTTACAGA GTGATAACGG CCCCACTTTC 
ACATTAGACA TACAATATCA CTTACACTGA 
AGAAAATGAA TCAAACGCTC AAATGACATC 
CATGGTTTGC TCTGTTGCCT ATAGCCTTAG 
GACTCAG CCC ATACGAAATG CTGTATGGAC 
ACCTAGAGAT GGCCAACTTA GTTGCAGATA 
TAAAACGTCA CAGGGAACCT GTCCCTGAGA 



33 - 

AAAGGAAAAG AAAGGGAAAT AGAAGGGAAC 1380 
GCAAGGCGGG ACCCTCCATT AGAAATGCTT 1440 
CTTCAGGAAA CCAAGCCCCA ATACTCAGCA 1500 
CATAGTTTCC TCCCCTCAGG ATGGCTAGCC 1560 
G CT A A CCA AT GGAAATTACT TAAAACCCTT 1620 
ACCCATCAGA TGGCTAAATC ATTATTTACT 1680 
ACAGTTAGGG CCTGTGAAGT GTGCCAAAGA 1740 
TCAGGAGAAA AAAGAACAGG CCATTACCCA 1800 
ATGCCCAAAT CTCAGGGATT TCAGTATCTA 1860 
GCGGAGGCCT TCCCTTGTAG GACAGAACAG 19 20 
GAAATAATTC CCAGATTTGG ATTTCCCCAA 1980 
AAGGCTACAG TAACCCAGGG AGTATCCCAG 2040 
GCCCGGAGGC CACAATCCTC AGGAAAGTTG 2100 
TAAAAAACCT AACCTAAGAA ACCCACCTCT 2160 
TAAGAATCCG AAACTCTCCC CAAAAAGCGG 2220 
GGCCCTTCCT AACCAATGAC CTTGTGCTTG 2280 
TCCCTCCTTA GCCAAATATC AACAAGTTCT 2340 
GGAGGGAAAG GAATTATTCC AACCTGGTGA 2400 
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CATGGTATTA CTGAAGTCCC TTCCCTCCAA 
GACCCTACTC AGTCATTTTA TCTATCCCAA 
GGATACATCA CATTCGAGTC AAACCCTAGA 
GACAA 



34 - 

CTCCCCATCC CCTGGATACA TCCTCGGAAG 2460 
CCGCGGTTAA AATGGCTGGA GTAGAATCTT 2520 
TACTGCCACA AGGAACCTGA AAATCCACGA 2580 

2585 



(2) INFORMATION FOR SEQ ID NO: 14: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 57 5 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
yg (D) TOPOLOGY: linear 

* 10 

jfj (ii) MOLECULE TYPE: DNA 

1=3 

jjj (iii) HYPOTHETICAL: NO 

^ 15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



ry 



GGGATAGCCC CCATCTATTT GGCCAGGCAT TACCCCAAGA CTTCAACCCA ATTCTCATAC 60 

CTGGACACTC TTCT CCTTTG GTATGTGGAT GATTTACTTT TAGCTTCCTG TTCAGAAACC 120 

TTGTG CCATC AAGCCACCCA AGCACTCTTA AATTTCCTCG CTACCTGTGG CTACAAGGTT 180 

TCCAAACCAA AGACCCAGCT CTGCTCACAG CAGGTTAAAT ACTTGGGGCT AAAATT AT CC 240 

AAACCCACCA CGGCCCTCAG TGAGGAACGT ATCAAGCCTA TACTGGCTTA TCCTCATCCC 300 

CAAATCCTAA AGCAACTAAG AGAGTTCCTT AGCATAACAG GTTTCTGCTG AATATGGATT 360 
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CCCAGGTATG CCAAAATAGC CAGACCATTA 
AATACCCATT TAGTAAGATG GATACCTGAA 
GCCCTAACCC AAG CCCCAGT GTTAAGCTTG 
ACAGAAAAAA CAGGAAATAG CTCTAGGAGT 
ACCCATGGCA T ACCTG AG T A AGGAAATTGA 
ATGGGTAGTG GCGGCAGTAG CAGTCTTAGC 
AGATCTTACT GTGTGGACAT CTCATGATGT 
GTGGCTGTCA GACAACCATT TACTTAAATA 
GCAACTGCGC AGTTGTGCAG CTCTTAACCC 
AACATAACTG CCAACAAGTA ATTT CTC AAA 
TTCCCTTAAC TGATCCCGAC CTCAACTTGT 
GACTTTGAAA AGTGGGGTAT GCAGTGCTCA 
TCATTCCAGG AACCAGCGTT CAGCTGGCAG 
TAGGAGAAGG AAAAAGGGTA AATACACATA 
GTGCCCACGC AGCAATATGG AG AG AAAGGG 
AACATCAGGA AGTTATTAGG AGATTATTAT 
TCTTACACTG CTGGGGTGGT C AG AAAG AAA 
CGGATATTGA AGCCAAAAGA CCCGCAAGCC 



TATACGCTAA TTAAGGAAAC TCAGAAAGCC 420 
GCAGAAGCAG CTTTCCAGGC CCTAAAGAGG 480 
CCAACAGGGC AAGACTTTAC TTCCTATCTC 540 
CCTTACACAA GTCTGAGGGA TGACCTTGCA 600 
TCTAGTGGCA AAGGGTTGGC CTCATTGTTT 660 
ATCTG A AG CA GTTAAAATGA TACAGGGAAG 720 
GAATGGCATA CTCACTGCTA AAGGAGACTT 780 
TCAGGCTGTA TTACTTGAAC GGCCAGTGCA 840 
AG CCA C ATTT CTTCCAGACA A TG AAG AT AC 900 
CCTAGGCCGC TCCAGGGAAC CTTTTAGAGG 960 
ATACTGATGG AAGTTCCTTT GTAGAAAAAG 1020 
GTGATAATGG AATACTTGAA AATAATCCCT 1080 
AATTAATAGC CCTCACTCGG GCATTAGAAT 1140 
CAGATTCTAA GTATGTTTAC TTAGTCCTCC 1200 
AATGCTTAAC TTCTGAGGGA ACACCTATCA 1260 
TGGCTATACA GAAACCTAAA GAGGTGGCAG 1320 
AGG AAAGGG A AATAAAAGGG AACTGCCAAG 1380 
AGGACCCTCC ATTACAAATG CTTATAGAAG 1440 
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GACCCCTAGT ATGGGGTAAT CCCCTCCGGG 
TAGAATGGGG AACCTCACCA GGACATAGTT 
AAGGAAAAAT ACTTTTCCCT GCAGCTAACC 
CCTTTCACTT AGACATTGAT AGCACCCATC 
GCCTTTTCAA AACTATCAAG CAGCTAGTCA 
CCATGCCTTA TCACCAAGCT CCTTCAGGAG 
RVTGGCAACT AGATTTTACC CACATGCCCA 
GGGTAGATAC TTTCACTGGT TGGGCAGAGA 
AGGTAATAAA GGCATTAGTT CATGAAATAA 
AGAGTGACAA TGGCCCTGCT TTCAAGGCTA 
TATACAATAT CACTTACACT GCGCCTGGAG 
TGAAACACTC AAACGACATC TAAAAAAAGC 
CTCTGTTGCC TATAGCCTTA CTAAGAATCC 
CATACGAAAT GCTATATGGA TAGCCCTTCC 
GAGCCAACTT AGTTGCAGAC ATCACCTCCT 
TACAAGGAGC CTGTCCCCGA GAAGAGGGGA 
TAGTCAAGTC CCTTCCCTCT AATTCTCATT 
CAGTCATTTT ATCTACCCCA ACCGCAGTAA 

ACACTCGAGT CAAACCCTGG ATATTACCAA 



AAACCAAGCC CCAATACTTA GAAAAAGAAA 1500 
TCCTCCCCTC AGGATGGCTA GCCACCGAAG 1560 
AATGGAAATT ACTTAAAACC CTTCACCAAA 1620 
AGATGGCCAA ATCATTATTT ACTGGACCAG 1680 
GGGCCTGTGA AGTGTGCCGA AGAAATAATC 1740 
AACAAAGAAC AGGCCATTAC CCAGCAGAAG 1800 
AATCTCAGGG ATTTCAGTAT CT ACT AG TTT 1860 
CCTTCCCCTG TAAGACAGAA AAGTCCCAAG 1920 
TTCCCAGATT CAGACTTCCC TGAGGCTTAC 1980 
CAGTAACCCA GGAGTATCCC ACGTGTTAGG 2040 
GCAGTCCTCA GGGAAGGCCG ACAAACTGAA 2100 
TAACCCAGGA AAACCACCTC ACATGGCCTG 2160 
AAAACTCTCC CC A AAA AG C A GGACTTAGCC 2220 
TAACCAATGA CCTTGTGCTT GACTGAGAGA 2280 
TATCCAAATA TCAACAAGTT CTTAAAACAT 2340 
AGGAACTATT CCACCCTGGT GACATGGTAT 2400 
G CCT AG AT AT ATCCTGGGAA GGACCCTACC 2460 
AAGTGGCTGG AGTGGAGTCT TGGATACATC 2520 

AGGAACCTGA AAATCCAGGA CACAA 2575 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

J3 TGAGAGACAG GACTAGCTGG ATTTCCTAGG CYGACTAAGA ATCCYTAAGC CTAGSTGGGA 60 

11 AGGTGACCAC RTCCACCTTT AAACACGGGG CTTGCAACTT AGYTCACACC TGACCAATCA 120 

^ GAGAGCTCAC TAAAATGCTA ATT AG G C AAA GACAGGAGGT AAAGAAATAG CCAATCATYT 180 

O ATTGCMTGAG AG C AC AG C AG GAGGGACAAY RATCGGGATA TAAACCCARG YHTTCGAGCY 240 

i« GCCAACRGCA GMCCCCCTTT GGGTCCCYTC CCTTTGTATG GGAGCTCTGT TTTCATGCTA 300 

TTTCACTCTA TTAAATCTTG CARCTGCRCT CTTCTGGTCC ATGTTTCTTA CGGCTYGAGC 360 

]=£ TGAGCTTTYG CTCRCCRTCC ACCACTGCTG TTTGCCRCCA CCGCANACCY GCCGCTGACT 420 

flJ CCCATCCCTC TGGATCMTGC AGGGTGTCCG CTGTGCTCCT GATCCAGCGA RGCRCCCATT 480 

~ GCCGCTCCCA ATYGGGCTAA AGGCTTGCCA TTGTNCCTGC AYGGCTAAGT GCCTGGGTTY 540 

§""»'• 

RTYCTAATTG AGCTGAACAC TANTCACTGG GTTCCATGGT TCTCTTCTGT GACCCACRGC 600 

TTCTAATAGA RCTATAACAC TYACCRCATG GCCCAAGRTT CCATTCCTTG GAATCCRTRA 660 

RGSCAACGAA CYCCASGTCA GAGAAYACGA RGCTTGCCAC CATCTTGGAA GCGGCCTGCT 720 

ACCATCTTGG AAGTGGTTCA CCACCATCTT GGGAGCTCTG TGAGCAAGGA CCCCCMRGTR 780 

ACA 783 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2 5 (ii) MOLECULE TYPE: DNA 
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(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TGTCCCCTCT GCTCCTGATC 20 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

20 

ATGCACTCTG GCTGGGCCAA T 21 

(2) INFORMATION FOR SEQ ID NO: 18: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ACCATTTGAC CCTCAGACAC T 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

AACCCTTTGC CACTACATCA ATTT 

(2) INFORMATION FOR SEQ ID NO : 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

TCAGGGATAG CCCCCATCTA T 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TTGTCTCCTG GATTTTCAGG TT 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

GGACCCTACC CAGTCATTTT 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleotide 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 



1Q ATCAGGAGCA CAGCGGACAC 20 



(2) INFORMATION FOR SEQ ID NO: 24: 



M (i) SEQUENCE CHARACTERISTICS: 

% 15 (A) LENGTH: 22 base pairs 

HP (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

m (D) TOPOLOGY: linear 

jdb 2 0 (ii) MOLECULE TYPE: DNA 

fu 

^ (iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

25 

CGACATCCAA AGTGATACAT CC 22 



(2) INFORMATION FOR SEQ ID NO: 25: 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: DNA 
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(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

AATGTATGGC CTGAAGTGCA G 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

CTTCCCAGGA TGTATCACTT TG 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

CACTGCAGAA GAATATAAGT CGTT 

(2) INFORMATION FOR SEQ ID NO: 28: 

REPLACEMENT SHEET (RULE 26) 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GCTTCCAAGA TGGTGGCAAG C 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

TCAGGGATAG CCCCCATCTA TTTGGCCAGG CATTAGCCCA AGACTTCAGC CAGTTCTCAT 60 

ACCTGGATAT TCTTGTCCTT TGGTATGCGG ATGATTTACT TTTAGCCGCC CG TT C AG AAA 120 

CCTTGTGCCA TCAAGCCACC CAAGTGCTCT TAAATTTCCT CGCCACCTGT GGCTACAAGG 1B0 

TTTCCAAACC AAAGG CTCAG CTCTGCTCAC AGCAGAAGGC TATTTACCCT AAATACTTAG 240 

GGCTGAAATT ATCCAAAGGC ACCAGGGCCC TCAGTGAGGA ATGTATCCAG CCTATACTGG 300 

CTTATCCTTA TCCCAAAACC CTAAAACAAC TAAGAAGGTT CCTTGGCATA ATAGGCATAA 360 

CAGGCATAAC AGGTTTCTGC TGAATATGGA TTCCCAAGTA CGGCAAAATA GCCAGACCAT 420 

TATATACACT AATTAAGGAA ACTCAGAAAG CCAATACCCA TTTAGTAAGA TGGACACCTG 480 

AAGCAGAGGC AG CTTTCC AG GCCGTAAAGA ACACCCTAAC CCAAGCCCCA GTGTTAAGCT 540 

TGCCAGCGGG GCAACACTTT TCTTTCTGTG TCACAGAAAA AATAGCAATA GCTNTAGGAG 600 

TCCTTACACA GGTCCGAGGG ACCAGCTTGC AACCCATGCC ATACCTGAGT AAGGAAATTG 660 

ATGTAGTGGC AAAGGGTT 678 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 536 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CCAATCTCCA TGTTGTATCC CCTTCCCCAA CTAATAAGGA CCCCCCTTTC AACCCAAACA 60 



GTCCAAAAGG ACATAGACAA AGGAGTAAAC AATGAACCAA AGAGTGCCAA TATTCCCTGG 120 



TTATGCACCC TCCAAGCGGT GGGAGAAGAA 
TCTCTCTCAC ACTTGAAGCA AATTAAAATA 
CGCTATATTG ATGTTTTACA AGGATTAGGA 
ATATTACTGC TAAATC AG A C GCTAACCTCA 



TTCGGCCCAG CCAGAGTGCA TGTACCTTTT 180 

GACCTAGGTA AATTCTCAGA TAGCCCTGAT 240 

CAATCCTTTG ATCTGACATG GAGAGATATA 300 

AATGAGAGAA GTGCTGCCAT AACTGG AG CC 360 



CGAGAGTTTG GCAATCTCTG GTATCTCAGT CAGGTCAATG ATAGGATGAC AACGGAGGAA 420 
iE AGAGAACGAT TCCCCACAGG GCAGCAGGCA GTTCCCAGTG TAG CTCCTCA TTGGGACACA 480 

SK5KS 

m 

Q GAATCAGAAC ATGGAGATTG GTCCCGCAGA CATTTAAAG C TTTCCCCGGG TACCGA 536 

» 5 (2) INFORMATION FOR SEQ ID NO: 31: 

M> 
nj 

\2 (i) SEQUENCE CHARACTERISTICS: 

W (A) LENGTH: 5 91 base pairs 

/5 (B) TYPE: nucleotide 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

15 (iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CCATGGCCAT CTACACTGAA CAAGATTTAT ACAATCATGT CGTACCTAAG CCCCACAACA 60 

AAAGAGTACC CATTCTTCCT TTTGTTATCA GAGCAGGAGT GCTAGGCAGA CTAGGTACTG 120 

GCATTGGCAG TATCACAACC TCTACTCAGT TCTACTACAA ACTATCTCAA GAAATAAATG 180 

GTGACATGGA ACAGGTCACT GACTCCCTGG TCACCTTGCA AGATCAACTT AACTCCCTAG 240 

CAGCAGTAGT CCTTCAAAAT CGAAGAGCTT TAGACTTGCT AACCGCCAAA AGACCGGGAA 300 

CCTGTTTATT TTTAGGAGAA GAACGCTGTT ATTATGTTAA T C AATCC AG A ATTCTCACTG 360 

Q AGAAAGTTAA AGAAATTCGA GATCGAATAC AATGTAGAGC AGAGGAGCTT CAAAACACCG 420 

"C AACGCTGGGG CCTCCTCACC CAATGGATGC CCTGGGTTCT CCCCTTCTTA GGACCTCTAG 480 

fir; 

Q CAGCTCTAAT ATTGTTACTC CTCTTTGGAC CCTGTATCTT TAACCTCCTT GTTAAGTTTG 540 

n I 

TCTCTTCCAG AATTGAAGCT GTftAAGCTAC AGATGGTCTT ACAAATCTAG A 591 

LI 5 (2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 64 base pairs 

(B) TYPE: nucleotide 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
15 (iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

CTAACCTGAG GATCCAGCAG CAGGACTGAG GGTGCCCGGG GCAAGTGCCA GCCCATGCCA 60 

TCACCCTCAG AG CCCCGGGT ATGTTTGACC ATTGACAGCC AGGAAGTTAA CTCTCTCCTG 120 

GACACTGGCG CAGCCTTCTC AGTCTTACTT TCCTGTCCCA GACAATTGTC CTCCAGATCT 180 

GTCACTATCC GAGGGGTCCT AGGACAGCCA GTCACTACAT ACTTCTCTCA GCCACTAAGT 240 

TGTGACTGGG GAACTTTACT CTTTTCACAT GCTTTTCTAA TTATG CCTG A AAGCCCCACT 300 

CCCTTGTTAG GGAGAGACAT TTTAGCAAAA G CAGGGG CCA TTATACACCT GAACAAGCTT 360 

GAAA 364 

5 (2) INFORMATION FOR SEQ ID NO: 33: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) 



MOLECULE TYPE: protein 



15 



(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Gly Leu Pro Tyr His lie Phe Leu Cys Ser Val Leu Ser Pro Cys 
15 10 15 

Phe Thr Leu Thr Ala Pro Pro Pro Cys Arg Cys Met Thr Ser Ser Ser 
20 25 30 

Pro His Pro Glu Phe Leu Trp Arg Met Gin Arg Pro Gly Asn lie Asp 
35 40 45 

Ala Pro Ser Tyr Arg Ser Leu Ser Lys Gly Thr Pro Thr Phe Thr Ala 
50 55 60 

His Thr His Met Pro Arg Asn Cys Tyr His Ser Ala Thr Leu Cys Met 
65 70 75 80 

His Ala Asn Thr His Tyr Trp Thr Gly Lys Met lie Asn Pro Ser Cys 
85 90 95 

Pro Gly Gly Leu Gly Val Thr Val Cys Trp Thr Tyr Phe Thr Gin Thr 
100 105 110 

Gly Met Ser Asp Gly Gly Gly Val Gin Asp Gin Ala Arg Glu Lys His 
115 120 125 

Val Lys Glu Val lie Ser Gin Leu Thr Gly Val Hie Gly Thr Ser Ser 
130 135 140 

Pro Tyr Lys Gly Leu Asp Leu Ser- Lys Leu His Glu Thr Leu Arg Thr 
145 150 155 160 
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His Thr Arg Leu Val Ser Leu Phe Asn Thr Thr Leu Thr Cly Leu Hie 
165 170 175 

Glu Val Ser Ala Gin Asn Pro Thr Asn Cys Trp lie Cys Leu Pro Leu 
180 185 190 

Asn Phe Arg Pro Tyr Val Ser lie Pro Val Pro Glu Gin Trp Asn Asn 
195 200 205 

Phe Ser Thr Glu lie Asn Thr Thr Ser Val Leu Val Gly Pro Leu Val 
210 215 220 

Ser Asn Val Glu lie Thr His Thr Ser Asn Leu Thr Cys Val Lys Phe 
225 230 235 240 

Ser Asn Thr Thr Tyr Thr Thr Asn Ser Gin Cys lie Arg Trp Val Thr 
245 250 255 

Pro Pro Thr Gin lie Val Cys Leu Pro Ser Gly lie Phe Phe Val Cys 
260 265 270 

Gly Thr Ser Ala Tyr Arg Cys Leu Asn Gly Ser Ser Glu Ser Met Cys 
275 280 285 

Phe Leu Ser Phe Leu Val Pro Pro Met Thr lie Tyr Thr Glu Gin Asp 
290 295 300 

Leu Tyr Ser Tyr Val lie Ser Lys Pro Arg Asn Lys Arg Val Pro lie 
305 310 315 320 

Leu Pro Phe Val lie Gly Ala Gly Val Leu Gly Ala Leu Gly Thr Gly 
325 330 335 

lie Gly Gly lie Thr Thr Ser Thr Gin Phe Tyr Tyr Lys Leu Ser Gin 
340 345 350 
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Glu Leu Asn Gly Asp Met Glu Arg Val Ala Asp Ser Leu Val Thr Leu 
355 360 365 . 



Gin Asp Gin Leu Asn 
370 

Ala Leu Asp Leu Leu 
385 

Gly Glu Glu Cys Cys 
405 

Lys Val Glu Glu lie 
420 

Arg Asn Thr Gly Pro 
435 

Leu Pro Phe Leu Gly 
450 

Gly Pro Cys lie Phe 
465 

Glu Ala Val Lys Leu 
485 

lie Tyr Arg Arg Pro 
500 

Asn Asp lie Lys Gly 
515 

Leu Leu Arg Pro Asn 
530 



Ser Leu Ala Ala Val Val 
375 

Thr Ala Glu Arg Gly Gly 
390 395 

Tyr Tyr Val Asn Gin Ser 
410 

Pro Asp Arg lie Gin Arg 

425 

Trp Gly Leu Leu Ser Arg 
440 

Pro Leu Ala Ala lie lie 
455 

Asp Leu Leu Val Asn Phe 
470 475 

Gin Met Glu Pro Lys Met 
490 

Leu Asp Arg Pro Ala Ser 
505 

Thr Pro Pro Glu Glu lie 
520 

Ser Ala Gly "Ser Ser 
535 



Leu Arg Asn Arg Arg 
380 

Thr Cys Leu Phe Leu 
400 

Gly He Val Thr Glu 
415 

He Ala Glu Glu Leu 
430 

Trp Met Pro Trp He 
445 

Leu Leu Leu Leu Phe 
460 

Val Ser Ser Arg He 
480 

Gin Ser Lys Thr Lys 
495 

Pro Arg Ser Asp Val 
510 

Ser Ala Ala Gin Pro 
525 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Glu Pro Lys Met Gin Ser Lys Thr Lys lie Tyr Arg Arg Pro Leu 
15 10 15 

Asp Arg Pro Ala Ser Pro Arg Ser Asp Val Asn Asp lie Lys Gly Thr 
20 25 30 

Pro Pro Glu Glu lie Ser Ala Ala Gin Pro Leu Leu Arg Pro Asn Ser 
35 40 45 

Ala Gly Ser Ser 
50 

(2) INFORMATION FOR SEQ ID NO : 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Leu Met Thr Ser Lys Ala Pro Leu Leu Arg Lys Ser Gin Leu His 
15 10 15 

Asn Leu Tyr Tyr Ala Pro lie Gin Gin Glu Ala Val Arg Ala Val Val 
20 25 30 

Gly Gin Pro Pro Gin Gin His Leu Gly Phe Pro Val Glu Met Gly Asp 
35 40 45 
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